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Preface 


These lecture notes are designed to accompany a course on “Computational Geometry” 
that we teach at the Department of Computer Science, ETH Zúrich, in every winter term 
since 2005. The course has evolved over the years and so have the notes, a first version of 
which was published in 2008. In the current setting, the course runs over 14 weeks, with 
three hours of lecture and two hours of exercises each week. In addition, there are three 
sets of graded homeworks which students have to hand in spread over the course. T'he 
target audience are third-year Bachelor or Master students of Mathematics or Computer 
Science. 

The selection of topics and their treatment is somewhat subjective, beyond what 
we consider essentials driven by the (diverse) research interests within our working 
group. There is a certain slant towards algorithms rather than data structures and a 
fair amount of influx from combinatorial geometry. The focus is on low-dimensional 
Euclidean space (mostly 2D), although we sometimes discuss possible extensions and/or 
remarkable differences when going to higher dimensions. At the end of each chapter 
there is a list of questions that we expect our students to be able to answer in the oral 
exam. 

Most parts of these notes have gone through a number of proof-readings, but expe- 
rience tells that there are always a few mistakes that escape detection. So in case you 
notice some problem, please let us know, regardless of whether it is a minor typo or 
punctuation error, a glitch in formulation, or a hole in an argument. This way the issue 
can be fixed for the next edition and future readers profit from your findings. 

We thank Tobias Christ, Anna Gundert, Gabriel Nivasch, Júlia Pap, Marek Sulovsky, 
May Szedlak, and Hemant Tyagi for pointing out errors in preceding versions. 
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Chapter 1 


Fundamentals 


1.1 Models of Computation 


When designing algorithms, one has to agree on a model of computation according to 
which these algorithms can be executed. There are various such models, but when it 
comes to geometry some are more convenient to work with than others. Even using 
very elementary geometric operations—such as taking the center of a circle defined by 
three points or computing the length of a given circular arc—the realms of rational 
and even algebraic numbers are quickly left behind. Representing the resulting real 
numbers/coordinates would be a rather painful task in, for instance, a Turing machine 
type model of computation. 

Therefore, other models of computation are more prominent in the area of geometric 
algorithms and data structures. In this course we will be mostly concerned with two 
models: the Real RAM and the algebraic computation/decision tree model. The 
former is rather convenient when designing algorithms, because it sort of abstracts from 
the aforementioned representation issues by simply assuming that it can be done. The 
latter model typically appears in the context of lower bounds, that is, proofs that certain 
problems cannot be solved more efficiently than some function depending on the problem 
size (and possibly some other parameters). 

So let us see what these models are in more detail. 


Real RAM Model. A memory cell stores a real number (that is what the “Real” stands 
forf] Any single arithmetic operation (addition, subtraction, multiplication, division, 
and k-th root, for small constant k) or comparison can be computed in constant time 
This is a quite powerful (and somewhat unrealistic) model of computation, as a single 
real number in principle can encode an arbitrary amount of information. Therefore we 


IRAM stands for random access machine, meaning that every memory cell can be accessed in constant 
time. Not like, say, a list where one always has to start from the first element. 

In addition, sometimes also logarithms, other analytic functions, indirect addressing (integral), or floor 
and ceiling are used. As adding some of these operations makes the model more powerful, it is usually 
specified and emphasized explicitly when an algorithm uses them. 
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have to ensure that we do not abuse the power of this model. For instance, we may want 
to restrict the numbers that are manipulated by any single arithmetic operation to be 
bounded by some fixed polynomial in the numbers that appear in the input. 

On the positive side, the real RAM model allows to abstract from the lowlands of 
numeric and algebraic computation and to concentrate on the algorithmic core from a 
combinatorial point of view. 

But there are also downsides to using such a powerful model. In particular, it may 
be a challenge to efficiently implement a geometric algorithm designed for the real RAM 
on an actual computer. With bounded memory there is no way to represent general 
real numbers explicitly, and operations using a symbolic representation can hardly be 
considered constant time. 

When interested in lower bounds, it is convenient to use a model of computation that 
encompasses and represents explicitly all possible execution paths of an algorithm. This 
is what the following model is about. 


Algebraic Computation Trees (Ben-Or [1]). A computation is regarded as a binary tree. 


e The leaves contain the (possible) results of the compu- 


tation. a 


e Every node v with one child has an operation of the T 
form +,—,x,/, free associated to it. The operands of a 
this operation are constant input values, or among the os A 
ancestors of v in the tree. eS ba ~ 
a—c b=c) 
e Every node v with two children has associated to it a p pa 


branching of the form > 0, > 0, or = 0. The branch ie Fe e 
is with respect to the result of v’s parent node. If the = < = 
expression yields true, the computation continues with o 

the left child of v; otherwise, it continues with the right “ 
child of v. 


The term decision tree is used if all of the final results (leaves) are either true or 
false. If every branch is based on a linear function in the input values, we face a linear 
deciston tree. Analogously one can define, say, quadratic decision trees. 

The complexity of a computation or decision tree is the maximum number of vertices 
along any root-to-leaf path. It is well known that O(n log n) comparisons are required 
to sort n numbers. But also for some problems that appear easier than sorting at first 
glance, the same lower bound holds. Consider, for instance, the following problem. 


Element Uniqueness 
Input: {x1,..., Xn CR, neN. 


Output: Is x; = xj, for some i,j € (1,..., n} with i £j? 


10 


CG 2013 1.2. Basic Geometric Objects 


Ben-Or has shown that any algebraic decision tree to solve Element Uniqueness 
for n elements has complexity O(nlogn). 


1.2 Basic Geometric Objects 


We will mostly be concerned with the d-dimensional Euclidean space R4, for small 
d € N; typically, d = 2 or d = 3. The basic objects of interest in R4 are the following. 


Points. A point p, typically described by its d Cartesian TEET er= (7,1) 
coordinates p = (x1,...,Xa). E 
Directions. A vector v € 84+ (the (d — 1)-dimensional ge E 

unit sphere), typically described by its d Cartesian coor- : 


dinates v = (x1,...,xa), with lvl = \/) $, xi? =1. 


Lines. A line is a one-dimensional affine subspace. It can 
be described by two distinct points p and q as the set of 
all points r that satisfy r = p + A(q — p), for some A € R. q 


While any pair of distinct points defines a unique line, a line in R? contains infinitely 
many points and so it may happen that a collection of three or more points lie on a line. 
Such a collection of points is termed collinear] 


Rays. If we remove a single point from a line and take 
the closure of one of the connected components, then we NP 


obtain a ray. It can be described by two distinct points p 
and q as the set of all points r that satisfy r = p+A(q—p), q 
for some A > 0. The orientation of a ray is the direction p 


(q —p)/llq — pll- 


Line segment. A line segment is a compact connected sub- 

set of a line. It can be described by two points p and q q 
as the set of all points r that satisfy r = p+A(q—p), for 7 

some A € [0,1]. We will denote the line segment through 

p and q by pq. Depending on the context we may allow P 

or disallow degenerate line segments consisting of a single 

point only (p = q in the above equation). 


Hyperplanes. A hyperplane H is a (d—1)-dimensional affine subspace. It can be described 
algebraically by d + 1 coefficients A,,...,Aaii € R, where ||(Ai,...,Aq+1)|| = 1, as the 
set of all points (x1, ..., Xa) that satisfy the linear equation H : Te ÀiXi = Aga: 


3Not colinear, which refers to a notion in the theory of coalgebras. 
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If the above equation is converted into an inequality, we obtain the algebraic descrip- 
tion of a halfspace (in R?: halfplane). 


Spheres and balls. A sphere is the set of all points that are equidistant to a fixed point. 
It can be described by a point c (center) and a number p € R (radius) as the set of all 
points p that satisfy ||p —c|| =p. The ball of radius p around p consists of all points p 
that satisfy ||p — cl| < p. 


References 


[1] Michael Ben-Or, Lower bounds for algebraic computation trees. In Proc. 15th Annu. 


ACM Sympos. Theory Comput., pp. 80-86, 1983, URL http://dx.doi.org/10. 
1145/800061 .808735 
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Chapter 2 


Polygons 


Although we can think of a line £ C R? as an infinite point set that consists of all points 
in R? that are on £, there still exists a finite description for £. Such a description is, 
for instance, provided by the three coefficients a,b,c € R of an equation of the form 
ax + by = c, with (a,b) 4 (0,0). Actually this holds true for all of the fundamental 
geometric objects that were mentioned in the previous section: Each of them has constant 
description complexity (or, informally, just size), that is, it can be described by a 
constanti] number of parameters. 

In this course we will typically deal with objects that are not of constant size. Often 
these are formed by merely aggregating constant-size objects, for instance, points to 
form a finite set of points. But sometimes we also demand additional structure that 
goes beyond aggregation only. Probably the most fundamental geometric objects of this 
type are what we call polygons. You probably learned this term in school, but what 
is a polygon precisely? Consider the examples shown in Figure Are all of these 
polygons? If not, where would you draw the line? 


(a) 


Figure 2.1: What is a polygon? 


2.1 Classes of Polygons 


Obviously, there is not the right answer to such a question and certainly there are 
different types of polygons. Often the term polygon is used somewhat sloppily in place 


1Unless specified differently, we will always assume that the dimension is (a small) constant. In a 
high-dimensional space Rt, one has to account for a description complexity of O(d). 
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of what we call a simple polygon, defined below. 


Definition 2.1 A simple polygon is a compact region P C R? that is bounded by a simple 
closed curve y : [0,1] — R? that consists of a finite number of line segments. A 
curve is a continuous map y : [0,1] + R?. A curve y is closed, if y(0) = y(1) and tt 
is simple if it is injective on [0,1), that is, the curve does not intersect itself. 


Out of the examples shown above only Polygon [2.1aļis simple. For each of the remaining 
polygons it is impossible to combine the bounding segments into a simple closed curve. 

The term compact for subsets of R means bounded and closed. A subset of P C R4 
is bounded, if it is contained in the ball of radius r around the origin, for some finite 
r >0. Being closed means that the boundary is considered to be part of the polygon. 
In order to formally define these terms, let us briefly review a few basic notions from 
topology. 

The standard topology of Rĉ is defined in terms of the Euclidean metric. A point 
p € R4 is interior to a set P C R4, if there exists an e-ball B.(p) = {x ER : ||x—pl| < e) 
around p, for some e€ > 0, that is completely contained in P. A set is open, if all of its 
points are interior; and it is closed, if its complement is open. 


Exercise 2.2 Determine for each of the following sets whether they are open or closed 
in R?. a)B,(0) 6){(1,0)} c)R? d)R*\Z? e) R*\Q’ Ff) {(x,y): xe R,y > 0} 


Exercise 2.3 Show that the union of countably many open sets in Rt is open. Show 
that the union of a finite number of closed sets in IRY is closed. (These are two of 
the axioms that define a topology. So the statements are needed to assert that the 
metric topology is a topology, indeed.) What follows for intersections of open and 
closed sets? Finally, show that the union of countably many closed sets in Rt is 
not necessarily closed. 


The boundary OP of a set P C Rt consists of all points that are neither interior to P 
nor to its complement R4 \ P. By definition, for every p € OP every ball B.(p) contains 
both points from P and from R“\P. Sometimes one wants to consider a set P C R? open 
although it is not. In that case one can resort to the interior P° of P that is formed by 
the subset of points interior to P. Similarly, the closure P of P is defined by P = PUOP. 

Lower-dimensional objects, such as line segments in R? or triangles in R?, do not 
possess any interior point (because the e-balls needed around any such point are full- 
dimensional). Whenever we want to talk about the interior of a lower-dimensional object, 
we use the qualifier relatzve and consider it relative to the smallest affine subspace that 
contains the object. 

For instance, the smallest affine subspace that contains a line segment is a line and 
so the relative interior of a line segment in R? consists of all points except the endpoints, 
just like for an interval in Rt. Similarly, for a triangle in R? the smallest affine subspace 
that contains it is a plane. Hence its relative interior is just the interior of the triangle, 
considered as a two-dimensional object. 
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Exercise 2.4 Show that for any P C R4 the interior P° is open. (Why ws there some- 
thing to show to begin with?) Show that for any P C Rê the closure P is closed. 


When describing a simple polygon P it is sufficient to describe only its boundary 
OP. As OP by definition is a simple closed curve y that consists of finitely many line 
segments, we can efficiently describe it as a sequence pi,...,Pn of points, such that y 
is formed by the line segments P¡P2,P2P3,---,Pn-1Pn,PnP1- These points are referred 
to as the vertices of the polygon, and the segments connecting them are referred as the 
edges of the polygon. The set of vertices of a polygon P is denoted by V(P), and the 
set of edges of P is denoted by E(P). 

Knowing the boundary, it is easy to tell apart the (bounded) interior from the (un- 
bounded) exterior. This is asserted even for much more general curves by the well-known 
Jordan-Curve Theorem. 


Theorem 2.5 (Jordan 1887) Any simple closed curve y : [0,1] — R? divides the plane 
into exactly two connected components whose common boundary is formed by y. 


In full generality, the proof of the deceptively obvious claim is surprisingly difficult. We 
will not prove it here, the interested reader can find a proof, for instance, in the book 
of Mohar and Thomassen [11]. There exist different generalizations of the theorem and 
there also has been some debate about to which degree the original proof of Jordan is 
actually correct. For simple polygons the situation is easier, though. The essential idea 
can be worked out algorithmically, which we leave as an exercise. 


Exercise 2.6 Describe an algorithm to decide whether a point lies inside or outside 
of a simple polygon. More precisely, given a simple polygon P C R? as a list of its 
vertices (v;,v2,...,Vn) in counterclockwise order and a query point q € R?, decide 
whether q 1s inside P, on the boundary of P, or outside. The runtime of your 
algorithm should be O(n). 


There are good reasons to ask for the boundary of a polygon to form a simple curve: 
For instance, in the example depicted in Figure[2.1b]there are several regions for which it 
is completely unclear whether they should belong to the interior or to the exterior of the 
polygon. A similar problem arises for the interior regions in Figure But there are 
more general classes of polygons that some of the remaining examples fall into. We will 
discuss only one such class here. It comprises polygons like the one from Figure P.1d] 


Definition 2.7 A region P C R? is a simple polygon with holes if it can be described as 
P=F\Unes H°, where H is a finite collection of pairwise disjoint simple polygons 
(called holes) and F is a simple polygon for which F° D Uues H- 


The way this definition heavily depends on the notion of simple polygons makes it 
straightforward to derive a similar trichotomy as the Jordan Curve Theorem provides 
for simple polygons, that is, every point in the plane is either inside, or on the boundary, 
or outside of P (exactly one of these three). 
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2.2 Polygon Triangulation 


From a topological point of view, a simple polygon is nothing but a disk and so it is a very 
elementary object. But geometrically a simple polygon can be—as if mocking the label 
we attached to it—a pretty complicated shape, see Figure for an example. While 
there is an easy and compact one-dimensional representation in terms of the boundary, 
as a sequence of vertices/points, it is often desirable to work with a more structured 
representation of the whole two-dimensional shape. 


Figure 2.2: A simple (?) polygon. 


For instance, it is not straightforward to compute the area of a general simple polygon. 
In order to do so, one usually describes the polygon in terms of simpler geometric objects, 
for which computing the area is easy. Good candidates for such shapes are triangles, 
rectangles, and trapezoids. Indeed, it is not hard to show that every simple polygon 
admits a “nice” partition into triangles, which we call a triangulation. 


Definition 2.8 A triangulation of a simple polygon P is a collection J of triangles, such 
that 


(1) P= Utes le 
(2) V(P) = Urtez VIT); and 


(3) for every distinct pair T,U € J, the intersection TMU is either a common 
vertex, or a common edge, or empty. 


Exercise 2.9 Show that each condition in Definition[2.8| is necessary in the following 
sense: Give an ezample of a non-triangulation that would form a triangulation if 
the condition was omitted. Is the definition equivalent if (3) is replaced by T° NU’? = 
0, for every distinct pair T,U € T? 


If we are given a triangulation of a simple polygon P it is easy to compute the area of P 
by simply summing up the area of all triangles from J. Triangulations are an incredibly 
useful tool in planar geometry, and one reason for their importance is that every simple 
polygon admits one. 


Theorem 2.10 Every simple polygon has a triangulation. 
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Proof. Let P be a simple polygon on n vertices. We prove the statement by induction on 
n. For n = 3 we face a triangle P that is a triangulation by itself. For n > 3 consider the 
lexicographically smallest vertex v of P, that is, among all vertices of P with a smallest x- 
coordinate the one with smallest y-coordinate. Denote the neighbors of v (next vertices) 
along OP by u and w. Consider the line segment uw. We distinguish two cases. 

Case 1: except for its endpoints u and w, the segment UW lies completely in P*. 
Then tw splits P into two smaller polygons, the triangle uvw and a simple polygon P’ 
on n—1 vertices (Figure [2.3a). By the inductive hypothesis, P’ has a triangulation that 
together with T yields a triangulation of P. 


u u 
v v 
w w 


(a) Case 1. (b) Case 2. 


Figure 2.3: Cases in the proof of Theorem 


Case 2: the relative interior of uw does not lie completely in P° (Figure [2.3b). By 
choice of v, the polygon P is contained in the closed halfplane to the right of the vertical 
line through v. Therefore, as the segments UV and VW are part of a simple closed curve 
defining OP, every point sufficiently close to v and between the rays vu and vw must be 
in P*. 

On the other hand, since Uw ¢ P”, there is some point from OP in the interior of 
the triangle T = uvw (by the choice of v the points u, v, w are not collinear and so T is 
a triangle, indeed) or on the line segment uw. In particular, as OP is composed of line 
segments, there is a vertex of P in T° or on Uw (otherwise, a line segment would have 
to intersect the line segment Uw twice, which is impossible). Let p denote a leftmost 
such vertex. Then the open line segment Vp is contained in T° and, thus, it splits P into 
two polygons P, and P, on less than n vertices each (in one of them, u does not appear 
as a vertex, whereas w does not appear as a vertex in the other). By the inductive 
hypothesis, both Pı and P2 have triangulations and their union yields a triangulation of 
P. O 


The configuration from Case 1 above is called an ear: Three consecutive vertices u,v, w 
of a simple polygon P such that the relative interior of uw lies in P°. In fact, we could 
have skipped the analysis for Case 2 by referring to the following theorem. 


Theorem 2.11 (Meisters [9] [10]) Every simple polygon that is not a triangle has two 
non-overlapping ears, that is, two ears A and B such that A* N B° = (). 
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But knowing Theorem we can obtain Theorem as a direct consequence of the 
following 


Theorem 2.12 Every triangulation of a simple polygon on n > 4 vertices contains at 
least two (triangles that are) ears. 


Exercise 2.13 Prove Theorem |2.12 


Exercise 2.14 Let P be a simple polygon with vertices vi, V2,...,Vn (tn counterclock- 
wise order), where vi has coordinates (xi, yi). Show that the area of P is 


1 n 
3 dE Xi41Ui — XiYi+1, 


i=1 


where (Xn+1, Yn+1) = (X1, Y1). 


The number of edges and triangles in a triangulation of a simple polygon are completely 
determined by the number of vertices, as the following simple lemma shows. 


Lemma 2.15 Every triangulation of a simple polygon on n > 3 vertices consists of 
n —2 triangles and 2n — 3 edges. 


Proof. Proof by induction on n. The statement is true for n = 3. For n > 3 consider 
a simple polygon P on n vertices and an arbitrary triangulation T of P. Any edge uv in 
T that is not an edge of P (and there must be such an edge because P is not a triangle) 
partitions P into two polygons P, and Pa with n; and no vertices, respectively. Since 
n¡,n2 < n we conclude by the inductive hypothesis that T partitions P, into n; — 2 
triangles and Pa into nz — 2 triangles, using 2n; — 3 and 2n2 — 3 edges, respectively. 
All vertices of P appear in exactly one of P; or P2, except for u and v, which appear in 
both. Therefore nı +n = n+2 and so the number of triangles in T is (nı—2)+(n2—2) = 
(nı +n.) -4=n+2—4=n-—2. Similarly, all edges of T appear in exactly one of P; 
or Pz, except for the edge uv, which appears in both. Therefore the number of edges in 
T is (2n; — 3) + (2na — 3) —1= 2(n, +n.) -7=2(n+2)-7=2n-3. O 


The universal presence of triangulations is something particular about the plane: The 
natural generalization of Theorem to dimension three and higher does not hold. 
What is this generalization, anyway? 


Tetrahedralizations in R3. A simple polygon is a planar object that is a topological disk 
that is locally bounded by patches of lines. The corresponding term in IR? is a polyhedron, 
and although we will not formally define it here yet, a literal translation of the previous 
sentence yields an object that topologically is a ball and is locally bounded by patches 
of planes. A triangle in R? corresponds to a tetrahedron in R? and a tetrahedralization 
is a nice partition into tetrahedra, where “nice” means that the union of the tetrahedra 
covers the object, the vertices of the tetrahedra are vertices of the polyhedron, and any 
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two distinct tetrahedra intersect in either a common triangular face, or a common edge, 
or a common vertex, or not at all? 

Unfortunately, there are polyhedra in R that do not admit a tetrahedralization. The 
following construction is due to Schónhardt [12]. It is based on a triangular prism, that 
is, two congruent triangles placed in parallel planes where the corresponding sides of both 
triangles are connected by a rectangle (Figure|2.4a)). Then one triangle is twisted/rotated 
slightly within its plane. As a consequence, the rectangular faces are not plane anymore, 
but they obtain an inward dent along their diagonal in direction of the rotation (Fig- 
ure |2.4b). The other (former) diagonals of the rectangular faces—labeled ab’, bc’, and 


(a) (b) 


Figure 2.4: The Schonhardt polyhedron cannot be subdivided into tetrahedra without 
adding new vertices. 


ca’ in Figure [2.4b}—are now epigonals, that is, they lie in the exterior of the polyhe- 
dron. Since these epigonals are the only edges between vertices that are not part of 
the polyhedron, there is no way to add edges to form a tetrahedron for a subdivision. 
Clearly the polyhedron is not a tetrahedron by itself, and so we conclude that it does 
not admit a subdivision into tetrahedra without adding new vertices. If adding new 
vertices—so-called Steiner vertices—is allowed, then there is no problem to construct a 
tetrahedralization, and this holds true in general. 


Algorithms. Knowing that a triangulation exists is nice, but it is much better to know 
that it can also be constructed efficiently. 


Exercise 2.16 Convert Theorem into an O(n?) time algorithm to construct a 
triangulation for a given simple polygon on n vertices. 


The runtime achieved by the straightforward application of Theorem is not optimal. 
We will revisit this question at several times during this course and discuss improved 
algorithms for the problem of triangulating a simple polygon. 


2These “nice” subdivisions can be defined in an abstract combinatorial setting, where they are called 
simplicial complices. 
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The best (in terms of worst-case runtime) algorithm known due to Chazelle [4] com- 
putes a triangulation in linear time. But this algorithm is very complicated and we will 
not discuss it here. There is also a somewhat simpler randomized algorithm to compute 
a triangulation in expected linear time [2], which we will not discuss in detail, either. 
Instead you will later see a much simpler algorithm with a pretty-close-to linear runtime 
bound. The question of whether there exists a simple (which is not really a well-defined 
term, of course, except that Chazelle’s Algorithm does not qualify) deterministic linear 
time algorithm to triangulate a simple polygon remains open [7]. 


Polygons with holes. It is interesting to note that the complexity of the problem changes 
to O(nlog n), if the polygon may contain holes [3]. This means that there is an algorithm 
to construct a triangulation for a given simple polygon with holes on a total of n vertices 
(counting both the vertices on the outer boundary and those of holes) in O(nlogn) 
time. But there is also a lower bound of O(nlogn) operations that holds in all models 
of computation in which there exists the corresponding lower bound for comparison- 
based sorting. This difference in complexity is a very common pattern: There are many 
problems that are (sometimes much) harder for simple polygons with holes than for 
simple polygons. So maybe the term “simple” has some justification, after all... 


Genaral triangle covers. What if we drop the “niceness” conditions required for triangu- 
lations and just want to describe a given simple polygon as a union of triangles? It 
turns out this is a rather drastic change and, for instance, it is unlikely that we can 
efficiently find an optimal/minimal description of this type: Christ has shown [5] that it 
is NP-hard to decide whether for a simple polygon P on n vertices and a positive integer 
k, there exists a set of at most k triangles whose union is P. In fact, the problem is not 
even known to be in NP, because it is not clear whether the coordinates of solutions can 
always be encoded compactly. 


2.3 The Art Gallery Problem 


In 1973 Victor Klee posed the following question: “How many guards are necessary, and 
how many are sufficient to patrol the paintings and works of art in an art gallery with n 
walls?” From a geometric point of view, we may think of an “art gallery with n walls” as 
a simple polygon bounded by n edges, that is, a simple polygon P with n vertices. And 
a guard can be modeled as a point where we imagine the guard to stand and observe 
everything that is in sight. In sight, finally, refers to the walls of the gallery (edges of 
the polygon) that are opaque and, thus, prevent a guard to see what is behind. In other 
words, a guard (point) g can watch over every point p € P, for which the line segment 
gp lies completely in P°, see Figure 
It is not hard to see that |n/3| guards are necessary in general. 


20 


CG 2013 2.3. The Art Gallery Problem 


Figure 2.5: The region that a guard g can observe. 


Exercise 2.17 Describe a family (P.)n>3 of simple polygons such that Pa has n vertices 
and at least |n/3| guards are needed to guard it. 


What is more surprising: |n/3| guards are always sufficient as well. Chvátal [6] was 
the first to prove that, but then Fisk gave a much simpler proof using—you may 
have guessed it—triangulations. Fisk’s proof was considered so beautiful that it was 
included into “Proofs from THE BOOK” [1], a collection inspired by Paul Erdós' belief 
in “a place where God keeps aesthetically perfect proofs”. The proof is based on the 
following lemma. 


Lemma 2.18 Every triangulation of a simple polygon is 3-colorable. That is, each 
verter can be assigned one of three colors in such a way that adjacent vertices 
receive different colors. 


Proof. Induction on n. For n = 3 the statement is obvious. For n > 3, by Theorem [2.12] 
the triangulation contains an ear uvw. Cutting off the ear creates a triangulation of a 
polygon on n — 1 vertices, which by the inductive hypothesis admits a 3-coloring. Now 
whichever two colors the vertices u and w receive in this coloring, there remains a third 
color to be used for v. O 


Theorem 2.19 (Fisk [8]) Every simple polygon on n vertices can be guarded using at 
most |n/3| guards. 


Proof. Consider a triangulation of the polygon and a 3-coloring of the vertices as ensured 
by Lemma Take the smallest color class, which clearly consists of at most |n/3| 
vertices, and put a guard at each vertex. As every point of the polygon is contained in 
at least one triangle and every triangle has exactly one vertex in the guarding set, the 
whole polygon is guarded. O 


Questions 


1. What is a simple polygon/a simple polygon with holes Explain the definitions 
and provide some examples of members and non-members of the respective classes. 
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Figure 2.6: A triangulation of a simple polygon on 17 vertices and a 3-coloring of tt. 
The vertices shown solid orange form the smallest color class and guard 
the polygon using |17/3| =5 guards. 


For a given polygon you should be able to tell which of these classes it belongs to 
or does not belong to and argue why this is the case. 


2. What is a closed/open/bounded set in Rt? What is the interior/closure of a 
point set? Explain the definitions and provide some illustrative examples. For a 
given set you should be able to argue which of the properties mentioned it possesses. 


3. What is a triangulation of a simple polygon? Does it always exist? Explain 
the definition and provide some illustrative examples. Present the proof of Theo- 


rem in detail. 


4. How about higher dimensional generalizations? Can every polyhedron in R? 
be nicely subdivided into tetrahedra? Explain Schonhardt’s construction. 


5. How many points are needed to guard a simple polygon? Present the proofs of 


Theorem Lemma 2.18) and Theorem in detail. 
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Chapter 3 


Convex Hull 


There exists an incredible variety of point sets and polygons. Among them, some have 
certain properties that make them “nicer” than others in some respect. For instance, 
look at the two polygons shown below. 


(a) A convex polygon. (b) A non-convex polygon. 


Figure 3.1: Ezamples of polygons: Which do you like better? 


As it is hard to argue about aesthetics, let us take a more algorithmic stance. When 
designing algorithms, the polygon shown on the left appears much easier to deal with 
than the visually and geometrically more complex polygon shown on the right. One 
particular property that makes the left polygon nice is that one can walk between any 
two vertices along a straight line without ever leaving the polygon. In fact, this statement 
holds true not only for vertices but for any two points within the polygon. A polygon 
or, more generally, a set with this property is called convez. 


Definition 3.1 A set P C Rê is convex if pq C P, for any p,q €P. 


An alternative, equivalent way to phrase convexity would be to demand that for every 
line £ C Rê the intersection LNP be connected. The polygon shown in Figure is not 
convex because there are some pairs of points for which the connecting line segment is not 
completely contained within the polygon. An immediate consequence of the definition 
is the following 
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Observation 3.2 For any family (Pi)icı of convex sets, the intersection [licr Pi 1s con- 
vex. 


Indeed there are many problems that are comparatively easy to solve for convex sets 
but very hard in general. We will encounter some particular instances of this phenomenon 
later in the course. However, not all polygons are convex and a discrete set of points is 
never convex, unless it consists of at most one point only. In such a case it is useful to 
make a given set P convex, that is, approximate P with or, rather, encompass P within 
a convex set H D P. Ideally, H differs from P as little as possible, that is, we want H to 
be a smallest convex set enclosing P. 

At this point let us step back for a second and ask ourselves whether this wish makes 
sense at all: Does such a set H (always) exist? Fortunately, we are on the safe side 
because the whole space R® is certainly convex. It is less obvious, but we will see below 
that H is actually unique. Therefore it is legitimate to refer to H as the smallest convex 
set enclosing P or—shortly—the convez hull of P. 


3.1 Convexity 


In this section we will derive an algebraic characterization of convexity. Such a charac- 
terization allows to investigate convexity using the machinery from linear algebra. 
Consider P C Rt. From linear algebra courses you should know that the linear hull 


lin(P) ={q |a =) Api AVi: pi eP, NER) 


is the set of all linear combinations of P (smallest linear subspace containing P). For 
instance, if P = {p} C R? \ {0} then lin(P) is the line through p and the origin. 
Similarly, the affine hull 


aff(P) := {q [=D Ap AD A=1AVi: pePRAER} 


is the set of all affine combinations of P (smallest affine subspace containing P). For 
instance, if P = {p,q} C R? and p  q then aff(P) is the line through p and q. 

It turns out that convexity can be described in a very similar way algebraically, which 
leads to the notion of convex combinations. 


Proposition 3.3 A set P C Rê is conver if and only if > i Aipi € P, for alln e N, 
Pinus pa EP, and M,... An 2 0 with} Aga 


Proof. “<=”: obvious with n = 2. 

“>”. Induction on n. For n = 1 the statement is trivial. For n > 2, let p; € P 
and A; > 0, for 1 < i < n, and assume ) į; à` = 1. We may suppose that A; > 0, 
for all i. (Simply omit those points whose coefficient is zero.) We need to show that 
2i Mpi EP. 
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Define A = o A, and for 1 <i < n—1 set u = A¡/A. Observe that u; > 0 
and > u = 1. By the inductive hypothesis, q := Ni upi € P, and thus by 
convexity of P also Aq + (1 — A)p, € P. We conclude by noting that Aq + (1 — A)pn = 
A o Hipi +AnPn = Mi Nipi. O 


Definition 3.4 The convex hull conv(P) of a set P C R® is the intersection of all convex 
supersets of P. 


At first glance this definition is a bit scary: There may be a whole lot of supersets for 
any given P and it not clear that taking the intersection of all of them yields something 
sensible to work with. However, by Observation we know that the resulting set 
is convex, at least. The missing bit is provided by the following proposition, which 
characterizes the convex hull in terms of exactly those convex combinations that appeared 
in Proposition already. 


Proposition 3.5 For any P CR“ we have 


conv(P) = (E MPi 


i=1 


neNA Y) AH=1 AWE ll... ME: nzorncert. 


i=1 


The elements of the set on the right hand side are referred to as conver combinations 
of P. 

Proof. “>”: Consider a convex set C D P. By Proposition [3.3] (only-if direction) the 
right hand side is contained in C. As C was arbitrary, the claim follows. 

“C”: Denote the set on the right hand side by R. Clearly R > P. We show that R 
forms a convex set. Let p = )_;_, Aipi and q = )_;|_, ppi be two convex combinations. 
(We may suppose that both p and q are expressed over the same p; by possibly adding 
some terms with a coefficient of zero.) 

Then for A € [0,1] we have Ap + (1—A)q = } ,(M2%; + (1 —A)pi)pi € R, as 
Mi +(1-A) wi SO, forall1<i<n, and» ,(MM+(1—Au) =A+(1-1=1. 0 
pai a 


20 >0 >0 


In linear algebra the notion of a basis in a vector space plays a fundamental role. In 
a similar way we want to describe convex sets using as few entities as possible, which 
leads to the notion of extremal points, as defined below. 


Definition 3.6 The convex hull of a finite point set P C RÌ forms a convex polytope. 
Each p € P for which p E conv(P \ {p}) zs called a vertex of conv(P). A vertex of 
conv(P) zs also called an extremal point of P. A convex polytope in R? is called a 
convex polygon. 


Essentially, the following proposition shows that the term vertex above is well defined. 


Proposition 3.7 A convex polytope in RÌ is the convex hull of its vertices. 
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Proof. Let P = ([p1,...,pn), n € N, such that without loss of generality pi,...,px 
are the vertices of P := conv(P). We prove by induction on n that conv(p1,...,pn) € 
conv(pi,...,Ppx). For n = k the statement is trivial. 

For n > k, p, is not a vertex of P and hence p, can be expressed as a convex 
combination pn = >)", Mpi. Thus for any x € P we can write x = } |; Hipi = 
A pin Dy pi = Dy (Mit undAdpi- As Dy (14 + Un Ai) = 1, we conclude 
inductively that x € conv(p1,..-,Pn—1) E conv(pi,..., Px). O 


3.2 Classical Theorems for Convex Sets 


Next we will discuss a few fundamental theorems about convex sets in R. The proofs 
typically use the algebraic characterization of convexity and then employ some techniques 
from linear algebra. 


Theorem 3.8 (Radon [8]) Any set P CRY of d+2 points can be partitioned into two 
disjoint subsets Pı and P2 such that conv(P,) Mconv(P2) Æ 0. 


Proof. Let P = {pi,...,Pa;2}. No more than d+ 1 points can be affinely independent 
in Rt. Hence suppose without loss of generality that pa,2 can be expressed as an affine 
combination of pi,...,Pa4i, that is, there exist A1,...,Aqa+1 E R with = Ai = 1 
and ais MiPi = pa+2. Let Pı be the set of all points p; for which A; is positive and 
let Pa = P \ Pi. Then setting Ay = —1 we can write Deis Mpi = Pets —MPi, 
where all coefficients on both sides are non-negative. Renormalizing by u = Ai/u and 
vi = Ai/v, where u = 2 peP: A, and v = —} peP Ai, yields convex combinations 
Dee, Hipi = pers vipi that describe a common point of conv(P,) and conv(P2). O 


Theorem 3.9 (Helly) Consider a collection € =([C1,..., Cn} of n > d+1 conver subsets 
of Rt, such that any d+1 pairwise distinct sets from © have non-empty intersection. 
Then also the intersection ();_, Ci of all sets from C is non-empty. 


Proof. Induction on n. The base case n = d+ 1 holds by assumption. Hence suppose 
that n > d+ 2. Consider the sets Di = Dir C;, for i € {1,...,n} As D; is an 
intersection of n — 1 sets from C, by the inductive hypothesis we know that D; 4 Ó. 
Therefore we can find some point p; € Di, for each i € {1,...,n}. Now by Theorem [3.8] 
the set P = {pi,..., Pn} can be partitioned into two disjoint subsets P, and P, such that 
conv(P,) Mconv(P2) 4 Ø. We claim that any point p € conv(P,) M conv(P,) also lies in 
Mi: Ci, which completes the proof. 

Consider some Cj, for i € {1,...,n}. By construction D; C Cj, for j #i. Thus pi 
is the only point from P that may not be in C;. As p; is part of only one of P, or Pa, 
say, of P4, we have Pa C Ci. The convexity of Ci implies conv(P,) C C; and, therefore, 
pe Ci. O 
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Theorem 3.10 (Carathéodory [3]) For any P C R? and q € conv(P) there exist k < d+1 
points p1,..., px € P such that q € conv(p1,...,Ppx). 


Exercise 3.11 Prove Theoreml3.10 


Theorem 3.12 (Separation Theorem) Any two compact conver sets C,D C RÌ with C N 
D = Ý cam be separated strictly by a hyperplane, that is, there exists a hyperplane 
h such that C and D le in the opposite open halfspaces bounded by h. 


Proof. Consider the distance function $: C x D > R with (c, d) > |lc— dll. Since C x D 
is compact and ô is continuous and strictly bounded from below by 0, the function ô 
attains its minimum at some point (cy, do) € C x D with (co, do) > O. Let h be the 
hyperplane perpendicular to the line segment codo and passing through the midpoint of 
Co and do. 

If there was a point, say, c’ in CN h, then by 
convexity of C the whole line segment coc’ lies in 
C and some point along this segment is closer to 
do than is Cy, in contradiction to the choice of co. 
The figure shown to the right depicts the situation 
in R?. If, say, C has points on both sides of h, then 
by convexity of C it has also a point on h, but we 
just saw that there is no such point. Therefore, C 
and D must lie in different open halfspaces bounded 
by h. O 


The statement above is wrong for arbitrary (not necessarily compact) convex sets. How- 
ever, if the separation is not required to be strict (the hyperplane may intersect the sets), 
then such a separation always exists, with the proof being a bit more involved (cf. [7], 
but also check the errata on Matousek’s webpage). 


Exercise 3.13 Show that the Separation Theorem does not hold in general, if not both 
of the sets are conver. 


Exercise 3.14 Prove or disprove: 
(a) The conver hull of a compact subset of RÌ is compact. 
(b) The convex hull of a closed subset of RÌ is closed. 


Altogether we obtain various equivalent definitions for the convex hull, summarized 
in the following theorem. 


Theorem 3.15 For a compact set P C RÌ we can characterize conv(P) equivalently as 
one of 


(a) the smallest (w. r. t. set inclusion) conver subset of RÌ that contains P; 
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(b) the set of all conver combinations of points from P; 
(c) the set of all conver combinations formed by d +1 or fewer points from P; 
(d) the intersection of all conver supersets of P; 


(e) the intersection of all closed halfspaces containing P. 


Exercise 3.16 Prove Theorem 


3.3 Planar Convex Hull 


Although we know by now what is the convex hull of point set, it is not yet clear how 
to construct it algorithmically. As a first step, we have to find a suitable representation 
for convex hulls. In this section we focus on the problem in R?, where the convex hull 
of a finite point set forms a convex polygon. A convex polygon is easy to represent, 
for instance, as a sequence of its vertices in counterclockwise orientation. In higher 
dimensions finding a suitable representation for convex polytopes is a much more delicate 
task. 


Problem 3.17 (Convex hull) 
Input: P = {pi,...,Ppn} C R?, ne N. 


Output: Sequence (qi,...,qn), 1 < h < n, of the vertices of conv(P) (ordered counter- 
clockwise). 


e ° e e 
o e e 
e o° e e y b e M 
e 
e e °? °’ ` eos, pr pas 
e > o eee 
e . ° bd ee . 
Pa è ` e o e o 
e e e .. ° 
e ê ° è 
e s e e "i 
e 
(a) Input. (b) Output. 


Figure 3.2: Convex Hull of a set of points in R?. 


Another possible algorithmic formulation of the problem is to ignore the structure of the 
convex hull and just consider it as a point set. 
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Problem 3.18 (Extremal points) 
Input: P = {pi,..., Pn} CR? ne N. 


Output: Set Q C P of the vertices of conv(P). 


Degeneracies. A couple of further clarifications regarding the above problem definitions 
are in order. 

First of all, for efficiency reasons an input is usually specified as a sequence of points. 
Do we insist that this sequence forms a set or are duplications of points allowed? 

What if three points are collinear? Are all of them considered extremal? According 
to our definition from above, they are not and that is what we will stick to. But note 
that there may be cases where one wants to include all such points, nevertheless. 

By the Separation Theorem, every extremal point p can be separated from the convex 
hull of the remaining points by a halfplane. If we take such a halfplane and translate its 
defining line such that it passes through p, then all points from P other than p should lie 
in the resulting open halfplane. In R? it turns out convenient to work with the following 
“directed” reformulation. 


Proposition 3.19 A point p € P = {pi,...,Pn} C R? is extremal for P <= there is a 
directed line g through p such that PX {p} is to the left of g. 


The zntertor angle at a vertex v of a polygon P is the angle 
between the two edges of P incident to v whose corresponding E r 
angular domain lies in P°. If this angle is smaller than 7, the 
vertex is called convez; if the angle is larger than 71, the vertex is 
called reflex. For instance, the vertex c in the polygon depicted 
to the right is a convex vertex, whereas the vertex labeled r is 
a reflex vertex. 


Exercise 3.20 


A set S C R? is star-shaped 1f there exists a point c € S, 
such that for every point p € S the line segment Tp is 
contained in S. A simple polygon with exactly three convex 
vertices is called a pseudotriangle (see the erample shown 
on the right). 


In the following we consider subsets of R?. Prove or disprove: 
a) Every convex verter of a simple polygon lies on its conver hull. 
b) Every star-shaped set is convez. 
c) Every convez set 1s star-shaped. 


d) The intersection of two conver sets is conver. 
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e) The union of two conver sets is conver. 

f) The intersection of two star-shaped sets is star-shaped. 

g) The intersection of a conver set with a star-shaped set is star-shaped. 
h) Every triangle is a pseudotriangle. 


1) Every pseudotriangle is star-shaped. 


3.4 Trivial algorithms 


One can compute the extremal points using Carathéodory’s Theorem as follows: Test 
for every point p € P whether there are q,r,s € P \{p} such that p is inside the triangle 
with vertices q, r, and s. Runtime O(n‘). 

Another option, inspired by the Separation Theorem: test for every pair (p,q) € P? 
whether all points from P \ {p, q} are to the left of the directed line through p and q (or 
on the line segment pq). Runtime O(n’). 


Exercise 3.21 Let P = (po,...,Pn—1) be a sequence of n points in ¡R?. Someone claims 
that you can check by means of the following algorithm whether or not P describes 
the boundary of a convex polygon in counterclockwise order: 


bool 1s convet(Po,...,Pn-1) L 
yor A O y 1: 
af (Pi, Pli+1)modn, P(i+2)moan) form a rightturn: 
return false; 
return true; 


1 


Disprove the claim and describe a correct algorithm to solve the problem. 


Exercise 3.22 Let P C R? be a convex polygon, given as an array p[0]...p[n-1] of its 
n vertices in counterclockwise order. 


a) Describe an O(log(n)) time algorithm to determine whether a point q lies 
inside, outside or on the boundary of P. 


b) Describe an O(log(n)) time algorithm to find a (right) tangent to P from a 
query point q located outside P. That is, find a verter pli], such that P is 


contained in the closed halfplane to the left of the oriented line qpli] 
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3.5 Jarvis’ Wrap 


We are now ready to describe a first simple algorithm to construct the convex hull. It 
works as follows: 


Find a point pı that is a vertex of conv(P) (e.g., the one with smallest x- 
coordinate). “Wrap” P starting from pj, i.e., always find the next vertex 
of conv(P) as the one that is rightmost with respect to the direction given 
by the previous two vertices. 


Besides comparing x-coordinates, the only geometric primitive needed is an orienta- 
tion test: Denote by rightturn(p, q,r), for three points p, q,r € R?, the predicate that 
is true if and only if r is (strictly) to the right of the oriented line pq. 


e next 
. A 
q[0]=p-start i . 


q [1] 


Code for Jarvis’ Wrap. 


p[O..N) contains a sequence of N points. 
p_start point with smallest x-coordinate. 
q_next some other point in p[0..N). 


int h = 0; 
Point_2 q_now = p_start; 
do { 
qlh] = q_now; 
h=h+1; 


for (int i= 0; i < N; i= i + 1) 
if (rightturn_2(q_now, q_next, pli])) 
q_next = plil; 
q_now = q_next; 
q_next = p_start; 
} while (q_now != p_start); 


q[0,h) describes a convex polygon bounding the convex hull of p[0. .N). 
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Analysis. For every output point the above algorithm spends n rightturn tests, which is 
= O(nh) in total. 


Theorem 3.23 [6] Jarvis? Wrap computes the conver hull of n points in R? using 
O(nh) rightturn tests, where h ts the number of hull vertices. 


In the worst case we have h = n, that is, O(n”) rightturn tests. Jarvis’ Wrap has a 
remarkable property that is called output sensitivity: the runtime depends not only on 
the size of the input but also on the size of the output. For a huge point set it constructs 
the convex hull in optimal linear time, if the convex hull consists of a constant number of 
vertices only. Unfortunately the worst case performance of Jarvis’ Wrap is suboptimal, 
as we will see soon. 


Degeneracies. The algorithm may have to cope with various degeneracies. 


e Several points have smallest x-coordinate = lexicographic order: 
(Px: Py) < (qx, qy) <> Px < qx V Px = qx A Py < qy . 


e Three or more points collinear = choose the point that is farthest among those 
that are rightmost. 


Predicates. Besides the lexicographic comparison mentioned above, the Jarvis’ Wrap 
(and most other 2D convex hull algorithms for that matter) need one more geomet- 
ric predicate: the rightturn or—more generally—orientation test. The computation 
amounts to evaluating a polynomial of degree two, see the exercise below. We therefore 
say that the orientation test has algebraic degree two. In contrast, the lexicographic 
comparison has degree one only. The algebraic degree not only has a direct impact on 
the efficiency of a geometric algorithm (lower degree + less multiplications), but also an 
indirect one because high degree predicates may create large intermediate results, which 
may lead to overflows and are much more costly to compute with exactly. 


Exercise 3.24 Prove that for three points (px, Py), (qx, dy), (Tx, Ty) € R?, the sign of 
the determinant 


1 Px Py 
1 qx qy 
1 T% Ty 


determines if r lies to the right, to the left or on the directed line through p and q. 
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3.6 Graham Scan (Successive Local Repair) 


There exist many algorithms that exhibit a better worst-case runtime than Jarvis’ Wrap. 
Here we discuss only one of them: a particularly elegant and easy-to-implement variant 
of the so-called Graham Scan [5]. This algorithm is referred to as Successive Local 
Repair because it starts with some polygon enclosing all points and then step-by-step 
repairs the deficiencies of this polygon, by removing non-convex vertices. It goes as 
follows: 

Sort points lexicographically and remove duplicates: (pi,...,Pn). 


P3 P5 


P1 


Po P4 P1 P3 P2 P5 Ps P7 Pe P7 Ps Ps P2 P3 Pi P4 Po 


As long as there is a (consecutive) triple (p, q,r) such that r is to the right of or on the 
directed line pd , remove q from the sequence. 


Code for Graham Scan. 


plO..N) lexicographically sorted sequence of pairwise distinct points, N > 2. 


qlO] = plo]; 
int h = QO; 
// Lower convex hull (left to right): 
for (int i = 1; i <N; i=i+1){ 
while (h>0 && !leftturn_2(q[h-1], q[h], plil)) 
h=h - 1; 
h=h+ 1; 
q[h] = p[i]; 
} 


// Upper convex hull (right to left): 
for (int i = N-2; i >= 0; i=i- 1) 4 
while (!leftturn_2(q[h-1], q[h], p[i])) 
h=h - 1; 
h=h+ 1; 
qth] = plil; 


q[0,h) describes a convex polygon bounding the convex hull of p[0. .N). 
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Analysis. 


Theorem 3.25 The convez hull of a set P C R? of n points can be computed using 
O(nlogn) geometric operations. 


Proof. 
1. Sorting and removal of duplicate points: O(nlogn). 


2. At the beginning we have a sequence of 2n — 1 points; at the end the sequence 
consists of h points. Observe that for every positive orientation test, one point is 
discarded from the sequence for good. Therefore, we have exactly 2n — h — 1 such 
shortcuts/positive orientation tests. In addition there are at most 2n — 2 negative 
tests (#iterations of the outer for loops). Altogether we have at most 4n — h— 3 
orientation tests. 


In total the algorithm uses O(nlogn) geometric operations. Note that the number of 
orientation tests is linear only, but O(n logn) lexicographic comparisons are needed. O 


3.7 Lower Bound 


It is not hard to see that the runtime of Graham Scan is asymptotically optimal in the 
worst-case. 


Theorem 3.26 O(nlogn) geometric operations are needed to construct the convex hull 
of n points in R? (in the algebraic computation tree model). 


Proof. Reduction from sorting (for which it is known that O(nlogn) comparisons 
are needed in the algebraic computation tree model). Given n real numbers X1,...,Xn, 
construct a set P = {pi |1 <i < n} of n points in R? by setting pi = (xi,x?). This 
construction can be regarded as embedding the numbers into R? along the x-axis and 
then projecting the resulting points vertically onto the unit parabola. The order in which 
the points appear along the lower convex hull of P corresponds to the sorted order of 
the xi. Therefore, if we could construct the convex hull in o(n log n) time, we could also 
sort in o(nlog n) time. O 


Clearly this reduction does not work for the Extremal Points problem. But us- 
ing a reduction from Element Uniqueness (see Section instead, one can show that 
O(nlog n) is also a lower bound for the number of operations needed to compute the set 
of extremal points only. This was first shown by Avis [i] for linear computation trees, 
then by Yao [9] for quadratic computation trees, and finally by Ben-Or for general 
algebraic computation trees. 
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3.8 Chan's Algorithm 


Given matching upper and lower bounds we may be tempted to consider the algorithmic 
complexity of the planar convex hull problem settled. However, this is not really the 
case: Recall that the lower bound is a worst case bound. For instance, the Jarvis’ Wrap 
runs in O(nh) time an thus beats the O(nlog n) bound in case that h = o(logn). The 
question remains whether one can achieve both output dependence and optimal worst 
case performance at the same time. Indeed, Chan [4] presented an algorithm to achieve 
this runtime by cleverly combining the “best of” Jarvis’ Wrap and Graham Scan. Let us 
look at this algorithm in detail. The algorithm consists of two steps that are executed 
one after another. 


Divide. Input: a set P C R? of n points and a number H € {1,... n}. 
1. Divide P into k = [n/H] sets P,,...,P, with |Pi| < H. 
2. Construct conv(P;) for alli, 1 <i<k. 


Analysis. Step 1 takes O(n) time. Step 2 can be handled using Graham Scan in 
O(Hlog H) time for any single P;, that is, O(n log H) time in total. 


Conquer. Output: the vertices of conv(P) in counterclockwise order, if conv(P) has less 
than H vertices; otherwise, the message that conv(P) has at least H vertices. 


1. Find the lexicographically smallest point in conv(P;) for alli, 1<1i<k. 


2. Starting from the lexicographically smallest point of P find the first H points of 
conv(P) oriented counterclockwise (simultaneous Jarvis’ Wrap on the sequences 
conv(P;)). 


Determine in every wrap step the point q; of tan- 
gency from the current point of conv(P) to conv(P;), 
for all 1 < i < k. We have seen in Exercise [3.22] how 
to compute qi in O(log |conv(P;)|) = O(log H) time. 
Among the k candidates qi,...,qx we find the next 
vertex of conv(P) in O(k) time. 

Analysis. Step 1 takes O(n) time. Step 2 con- 
sists of at most H wrap steps. Each wrap step needs 
O(klog H + k) = O(klog H) time, which amounts to 
O(Hklog H) = O(n log H) time for Step 2 in total. 

Remark. Using a more clever search strategy instead of many tangency searches one 
can handle the conquer phase in O(n) time, see Exercise below. However, this is 
irrelevant as far as the asymptotic runtime is concerned, given that already the divide 
step takes O(n log H) time. 
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Exercise 3.27 Consider k convez polygons P1,...Px, for some constant k € N, where 
each polygon is given as a last of its vertices in counterclockwise omentation. Show 
how to construct the convex hull of P,U...UP, in O(n) time, where n = ar Ni 
and n; ts the number of vertices of Pi, for1<i<k. 


Searching forh. While the runtime bound for H = his exactly what we were heading for, 
it looks like in order to actually run the algorithm we would have to know h, which— 
in general—we do not. Fortunately we can circumvent this problem rather easily, by 
applying what is called a doubly exponential search. It works as follows. 

Call the algorithm from above iteratively with parameter H = min{2?',n}, for t = 
0,..., until the conquer step finds all extremal points of P (i.e., the wrap returns to its 
starting point). 

Analysis: Let 22 be the last parameter for which the algorithm is called. Since the 
previous call with H = 2% * did not find all extremal points, we know that 22 * < h, 
that is, 2%? < logh, where h is the number of extremal points of P. The total runtime 
is therefore at most 


X en log 2% = em) 2t = en(2%** — 1) < 4cnlogh = O(n logh), 
i=0 i=0 


for some constant c € R. In summary, we obtain the following theorem. 


Theorem 3.28 The convex hull of a set P C R? of n points can be computed using 
O(nlog h) geometric operations, where h is the number of convex hull vertices. 
Questions 


6. How is convexity defined? What is the convex hull of a set in Rt? Give at 
least three possible definitions. 


7. What does it mean to compute the conver hull of a set of points in R?? Discuss 
input and expected output and possible degeneracies. 


8. How can the conver hull of a set of n points in R? be computed efficiently? 
Describe and analyze (incl. proofs) Jarvis’ Wrap, Successive Local Repair, and 
Chan’s Algorithm. 


9. Is there a linear time algorithm to compute the convex hull of n points in IR? ? 
Prove the lower bound and define/explain the model in which it holds. 


10. Which geometric primitive operations are used to compute the convez hull of 
n points in R?? Explain the two predicates and how to compute them. 
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Remarks. The sections on Jarvis’ Wrap and Graham Scan are based on material that 
Emo Welzl prepared for a course on “Geometric Computing” in 2000. 
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Chapter 4 
Plane Graphs and the DCEL 


So far we have been talking about geometric structures such as triangulations of polygons 
without paying much attention to how to represent them. This will change now. As 
many of these structures can be regarded as plane graphs, we start by reviewing a bit of 
terminology from (planar) graph theory. We assume that this is—at least somewhat— 
familiar ground for you. If you would like to see more details and examples, please refer 
to any standard textbook on graph theory, such as Bondy and Murty [2], Diestel [3], or 
West [10]. 

A (simple undirected) graph is a pair G = (V,E) where V is a finite set of vertices 
and E is the set of edges, E C u := {{u,v} | u,v € V,u Æ v}. For brevity, one often 
writes uv rather than {u,v} to denote an edge. The two vertices defining an edge are 
adjacent to each other and they are incident to the edge. 

For a vertex v € V, denote by Ng(v) the neighborhood of v in G, that is, the 
set of vertices from G that are adjacent to v. Similarly, for a set W C V of vertices 
define Ne (W) := Uwew Na(w). The degree degg(v) of a vertex v € V is the size of 
its neighborhood, that is, the number of edges from E incident to v. The subscript is 
often omitted when it is clear to which graph it refers to. As every edge is incident to 
exactly two vertices, we have the following simple but useful relationship, often called 
handshaking-lemma: } „<y deg(v) = 2]E]. 

A walk in a graph G is a sequence W = (v1, ..., Vk), k € N, of vertices such that vi 
and vi,1 are adjacent in G, forall 1 <i < k. A path is a walk whose vertices are pairwise 
distinct. A cycle is a walk whose vertices are pairwise distinct, except for vı = vk. A 
graph is connected, if there is a path between any pair of vertices. If a graph is not 
connected, it is disconnected. A disconnected graph can be decomposed into maximal 
connected subgraphs, its (connected) components. 

A tree is a connected graph that does not have any cycle. It is not hard to show that 
trees on n vertices are exactly the graphs on n vertices that have n — 1 edges. 

In geometry we are typically concerned with graphs that are embedded on some 
surface, here R?. An embedding or drawing is just a “nice” mapping of a graph G into 
the plane. More formally, each vertex v is mapped to a distinct point @(v) € R? (that 
is, y : V > R? is injective) and each edge uv is mapped to an arc—a simple Jordan 
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curve—q (uv) C R? from p(u) to (v) such that no vertex is mapped to the relative 
interior of p(uv). 

A graph is planar if it admits a crossing-free drawing, that is, a drawing in which 
no two arcs intersect except at common endpoints. For example, K, (the complete graph 
on four vertices) is planar, as demonstrated by the drawing shown in Figure 


(a) Crossing-free drawing. (b) Straight-line drawing. 


Figure 4.1: Embeddings of K, into the plane. 


If a graph is planar, then by Fáry-Wagner's Theorem [4] [8] there also exists a straight- 
line drawing, that is, a drawing in which all arcs are line segments. In order to obtain 
such a drawing for K4, we have to put one vertex inside the convex hull of the other 
three, see Figure [4.1b} 

Sometimes we refer to graphs not as abstract graphs but in a concrete embedding. 
If this embedding is a crossing-free embedding into the plane, the embedded graph is 
referred to as a plane graph. Note the distinction between “planar” and “plane”: The 
former refers to an abstract graph and expresses the possibility of a crossing-free drawing, 
whereas the latter refers to a geometric object that is a concrete crossing-free drawing of 
a graph in the plane. 

A plane straight-line graph (PSLG) is a crossing-free straight-line drawing of a graph 
in the plane. Both graphs in Figure are plane, but only the one shown on the right 
is also a plane straight-line graph. 


4.1 The Euler Formula 


If we remove all vertices (points) and edges (arcs) of a plane graph G from R?, then what 
remains is a finite collection of open sets. These sets are referred to as the faces of G. 
For instance, both plane graphs in Figure [4.1] have 4 vertices, 6 edges and 4 faces (one 
of which is unbounded). In general, if |V| is the number of vertices of a connected plane 
graph, |E| its number of edges and |F| the number of faces, then the Euler Formula states 
that 


IVI — JE] + IF| = 2. 
In the example, we get 4—6 +4=2. 
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If you do not insist on being too formal, the proof is simple and works by induction 
on the number of edges. If we fix the number of vertices, the base case occurs for |V|— 1 
edges where the plane graph is a tree. Then we have |F| = 1 and the formula holds. 
A graph with more edges always contains a cycle and therefore at least one bounded 
face. Choose one edge from a bounded face and remove it. The resulting graph is still 
connected and has one edge less but also one face less since the edge removal merges 
the bounded face with another one. Consequently, since the Euler Formula holds for the 
smaller graph by induction, it also holds for the larger graph. 

The Euler Formula can be used to prove the following important fact about planar 
graphs. 


Lemma 4.1 Any planar graph on n > 3 vertices has at most 3n—6 edges and at most 
2n — 4 faces. 


This lemma shows that the overall complexity—the sum of the number of vertices, edges, 
and faces—of a planar graph is linear in n. A planar graph with a maximal number 
(3n — 6) of edges is called mazimal planar. 


Exercise 4.2 Prove Lemma|¿.1] using the Euler formula. 
Exercise 4.3 Prove that every planar graph has a vertex of degree at most 5. 


Exercise 4.4 Show that Ks (the complete graph on five vertices) 1s not planar. 


4.2 The Doubly-Connected Edge List 


Many algorithms in computational geometry work with plane graphs. 'The doubly- 
connected edge list (DCEL) is a data structure to represent a plane graph in such a way 
that it is easy to traverse and to manipulate. In order to avoid unnecessary complications, 
let us discuss only connected graphs here that contain at least two vertices. It is not hard 
to extend the data structure to cover all plane graphs. For simplicity we also assume 
that we deal with a straight-line embedding and so the geometry of edges is defined by 
the mapping of their endpoints already. For more general embeddings, the geometric 
description of edges has to be stored in addition. 

The main building block of a DCEL is a list of halfedges. Every actual edge is 
represented by two halfedges going in opposite direction, and these are called twins, see 
Figure Along the boundary of each face, halfedges are oriented counterclockwise. 

A DCEL stores a list of halfedges, a list of vertices, and a list of faces. These lists are 
unordered but interconnected by various pointers. A vertex v stores a pointer halfedge(v) 
to an arbitrary halfedge originating from v. Every vertex also knows its coordinates, that 
is, the point point(v) it is mapped to in the represented embedding. A face f stores a 
pointer halfedge(f) to an arbitrary halfedge within the face. A halfedge h stores five 
pointers: 
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next(h) 2. ----. target(h) 


twin(h) 
face(h) e 


an o 
Figure 4.2: A halfedge in a DCEL. 


e a pointer target(h) to its target vertex, 


== 


e a pointer face(h) to the incident face, 


e a pointer twin(h) to its twin halfedge, 


e a pointer next(h) to the halfedge following h along the boundary of face(h), and 


A 


e a pointer prev(h) to the halfedge preceding h along the boundary of face(h). 


A constant amount of information is stored for every vertex, (half-)edge, and face of the 
graph. Therefore the whole DCEL needs storage proportional to |V| + [El + |F|, which is 
O(n) for a plane graph with n vertices by Lemma [4.1] 

This information is sufficient for most tasks. For example, traversing all edges around 
a face f can be done as follows: 


s — halfedge(f) 
hes 
do 
something with h 
h + next(h) 
while h Æ s 


Exercise 4.5 Give pseudocode to traverse all edges incident to a given vertex v of a 
DCEL. 


Exercise 4.6 Why is the previous halfedge prev(-) stored explicitly and the source ver- 
tex of a halfedge is not? 


4.2.1 Manipulating a DCEL 


In many geometric algorithms, plane graphs appear not just as static objects but rather 
they evolve over the course of the algorithm. Therefore the data structure used to 
represent the graph must allow for efficient update operations to change it. 
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First of all, we need to be able to generate new vertices, edges, and faces, to be added 
to the corresponding list within the DCEL and—symmetrically—the ability to delete an 
existing entity. Then it should be easy to add a new vertex v to the graph within some 
face f. As we maintain a connected graph, we better link the new vertex to somewhere, 
say, to an existing vertex u. For such a connection to be possible, we require that the 
open line segment uv lies completely in f. 

Of course, two halfedges are to be added connecting u and v. But where exactly? 
Given that from a vertex and from a face only some arbitrary halfedge is directly accessi- 
ble, it turns out convenient to use a halfedge in the interface. Let h denote the halfedge 
incident to f for which target(h) = u. Our operation becomes then (see also Figure 


add-vertex-at(v, h) 

Precondition: the open line segment point(v)point(u), where u := target(h), 
lies completely in f := face(h). 

Postcondition: a new vertex v has been inserted into f, connected by an edge 


We 
h2 f 
oN 


a 


(a) before (b) after 


Figure 4.3: Add a new vertex connected to an existing vertez u. 


and it can be realized by manipulating a constant number of pointers as follows. 


add-vertex-at(v, h) { 
hı + a new halfedge 
hə + a new halfedge 
halfedge(v) — hy 
twin(h,) + hg 
twin(h>) + hy 
target(h,) E v 
target(h2)  u 
face(h,) + f 
face 
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prev(h2) — hy 
next(h) + hy 
prev(next(h2)) < hy 


Similarly, it should be possible to add an edge between two existing vertices u and v, 
provided the open line segment uv lies completely within a face f of the graph, see 
Figure Since such an edge insertion splits f into two faces, the operation is called 
split-face. Again we use the halfedge h that is incident to f and for which target(h) = u. 


S > > 


Po before (b) after 


Figure 4.4: Split a face by an edge uv. 


Our operation becomes then 


split-face(h, v) 

Precondition: v is incident to f := face(h) but not adjacent to u := target(h). 
The open line segment point(v)point(u) lies completely in f. 

Postcondition: f has been split by a new edge uv. 


The implementation is slightly more complicated compared to add-vertex-at above, be- 
cause the face f is destroyed and so we have to update the face information of all incident 
halfedges. In particular, this is not a constant time operation, but its time complexity 
is proportional to the size of f. 


split-face(h, v) { 
fı E a new face 
fə < a new face 
hı + a new halfedge 
hz — a new halfedge 
halfedge(f,) — hy 
halfedge(f2) — hə 
twin(hi) < ha 
twin(h2) + hy 
target(h,) E v 
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target(h2)  u 
next(h2) — next(h) 
prev(next(h2)) — h2 
prev(h,) — h 
next(h) < hı 
it he 
loop 

face(i) — f2 

if target(i) = v break the loop 

i + next(i) 
endloop 
next(h ,) < next(i) 
prev(next(h,)) — hı 
next(i) < hg 
prev(h2) <i 
it hy 
do 

face(i) + fı 

i + next(i) 
until target(i) = u 
delete the face f 


} 


In a similar fashion one can realize the inverse operation join-face(h) that removes the 
edge (represented by the halfedge) h, thereby joining the faces face(h) and face(twin(h)). 

It is easy to see that every connected plane graph on at least two vertices can be 
constructed using the operations add-vertex-at and split-face, starting from an embedding 
of Kz (two vertices connected by an edge). 


Exercise 4.7 Give pseudocode for the operation join-face(h). Also specify precondi- 
tions, if needed. 


Exercise 4.8 Give pseudocode for the operation split-edge(h), that splits the edge (rep- 
resented by the halfedge) h into two by a new vertez w, see Figure 4. 5] 


4.2.2 Graphs with Unbounded Edges 


In some cases it is convenient to consider plane graphs, in which some edges are not 
mapped to a line segment but to an unbounded curve, such as a ray. This setting is not 
really much different from the one we studied before, except that one vertex is placed 
“at infinity”. One way to think of it is in terms of stereographic projection: Imagine 
R? being the x/y-plane in R? and place a unit sphere S such that its southpole touches 
the origin. We obtain a bijective continuous mapping between R? and S \{n}, where n 
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J \ IN 
\ a fi Ny fy 
(a) before (b) after 


Figure 4.5: Splat an edge by a new verter. 


is the northpole of S, as follows: A point p € R? is mapped to the point p’ that is the 
intersection of the line through p and n with S, see Figure [4.6] The further away a point 


n 
E n 
i i A 
P 0 q 


(a) Three-dimensional view. (b) Cross-section view. 


Figure 4.6: Stereographic projection. 


in R? is from the origin, the closer its image on S gets to n. But there is no way to reach 
n except in the limit. Therefore, we can imagine drawing the graph on S instead of in 
R? and putting the “infinite vertex” at n. 

All this is just for the sake of a proper geometric interpretation. As far as a DCEL 
representation of such a graph is concerned, there is no need to consider spheres or, in 
fact, anything beyond what we have discussed before. The only difference to the case 
with all finite edges is that there is this special infinite vertex, which does not have any 
point/coordinates associated to it. But other than that, the infinite vertex is treated 
in exactly the same way as the finite vertices: it has in— and outgoing halfedges along 
which the unbounded faces can be traversed (Figure [4.7). 


4.2.3 Remarks 


It is actually not so easy to point exactly to where the DCEL data structure originates 
from. Often Muller and Preparata [7] are credited, but while they use the term DCEL, 
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AN 


i 
LEZ. 


Figure 4.7: A DCEL with unbounded edges. Usually, we will not show the infinite 
verter and draw all edges as straight-line segments. This yields a geo- 
metric drawing, like the one within the gray boz. 


the data structure they describe is different from what we discussed above and from what 
people usually consider a DCEL nowadays. Overall, there is a large number of variants 
of this data structure, which appear under the names winged edge data structure [1], 
halfedge data structure [9], or quad-edge data structure [5]. Kettner [6] provides a 
comparison of all these and some additional references. 


Questions 


11. What are planar/plane graphs and straight-line embeddings? Give the defini- 
tions and explain the difference between planar and plane. 


12. How many edges can a planar graph have? What is the average vertex degree 
in a planar graph? Explain Euler’s formula and derive your answers from it (see 


Exercise and [4.3). 


13. How can plane graphs be represented on a computer? Explain the DCEL data 
structure and how to work with it. 
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Chapter 5 


Line Sweep 


In this chapter we will discuss a simple and widely applicable paradigm to design ge- 

ometric algorithms: the so-called Line-Sweep (or Plane-Sweep) technique. It can be 

used to solve a variety of different problems, some examples are listed below. The first 

part may come as a reminder to many of you, because you should have heard something 

about line-sweep in one of the basic CS courses already. However, we will soon proceed 

and encounter a couple of additional twists that were most likely not covered there. 
Consider the following geometric problems. 


Problem 5.1 (Simple Polygon Test) Given a sequence P = 
(pi,---,Pn) of points in R?, does P describe the boundary 
of a simple polygon? 


Problem 5.2 (Polygon Intersection) Given two simple poly- 
gons P and Q in R? as a (counterclockwise) sequence of 
their vertices, is P N Q = 0? 


Problem 5.3 (Segment Intersection Test) Given a set S of n 
closed line segments in R?, do any two of them intersect? 


Remark: In principle it is clear what is meant by “two segments intersect”. But there 
are a few special cases that one may have to consider carefully. For instance, does it count 
if an endpoint lies on another segment? What if two segments share an endpoint? What 
about overlapping segments and segments of length zero? In general, let us count all 
these as intersections. However, sometimes we may want to exclude some of these cases. 
For instance, in a simple polygon test, we do not want to consider the shared endpoint 
between two consecutive edges of the boundary as an intersection. 


Problem 5.4 (Segment Intersections) Given a set S of n < 
closed line segments in R?, compute all pairs of segments > 
that intersect. yy Sa 
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duced by S, that is, the subdivision of R? induced by S. 


Problem 5.5 (Segment Arrangement) Given a set S of n 
closed line segments in R?, construct the arrangement in- wee > oe 


R?, construct the arrangement induced by SUT. 


Problem 5.6 (Map Overlay) Given two sets S and T of n and 
m, respectively, pairwise interior disjoint line segments in Pas = 2 
In the following we will use Problem [5.4] as our flagship example. 


Trivial Algorithm. ‘Test all the (5) pairs of segments from S in O(n?) time and O(n) space. 
For Problem [5.4]this is worst-case optimal because there may by Q(n?) intersecting pairs. 

But in case that the number of intersecting pairs is, say, linear in n there is still hope 
to obtain a subquadratic algorithm. Given that there is a lower bound of O(nlog n) for 
Element Uniqueness (Given X1,...,Xn € R, is there ani 4 j such that x; = xj?) in the 
algebraic computation tree model, all we can hope for is an output-sensitive runtime of 
the form O(nlogn + k), where k denotes the number of intersecting pairs (output size). 


5.1 Interval Intersections 
As a warmup let us consider the corresponding problem in R+. 


Problem 5.7 Given a set I of n intervals [l;,r¡] C R, 1 <i < n. Compute all pairs of 
intervals from I that intersect. 


Theorem 5.8 Problem|5. 7 can be solved in O(nlogn+k) time and O(n) space, where 
k is the number of intersecting pairs from Ge 


Proof. First observe that two real intervals intersect if and only if one contains the right 
endpoint of the other. 

Sort the set ((f,,0)|1 <i < nju((r;,1)/|1 < i < n} in increasing lexicographic order 
and denote the resulting sequence by P. Store along with each point from P its origin 
(i). Walk through P from start to end while maintaining a list L of intervals that contain 
the current point p € P. 

Whenever p = (£;,0), 1 < i < n, insert i into L. Whenever p = (r;,,1),1<i<mn, 
remove i from L and then report for all j € L the pair {i, j} as intersecting. O 


5.2 Segment Intersections 
How can we transfer the (optimal) algorithm for the corresponding problem in R+ to the 


plane? In R! we moved a point from left to right and at any point resolved the situation 
locally around this point. More precisely, at any point during the algorithm, we knew all 
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intersections that are to the left of the current (moving) point. A point can be regarded 
a hyperplane in Rt, and the corresponding object in R? is a line. 


General idea. Move a line £ (so-called sweep line) from left to right over the plane, such 
that at any point during this process all intersections to the left of £ have been reported. 


Sweep line status. The list of intervals containing the current point corresponds to a list L 
of segments (sorted by y-coordinate) that intersect the current sweep line £. This list L is 
called sweep line status (SLS). Considering the situation locally around L, it is obvious 
that only segments that are adjacent in L can intersect each other. This observation 
allows to reduce the overall number of intersection tests, as we will see. In order to 
allow for efficient insertion and removal of segments, the SLS is usually implemented as 
a balanced binary search tree. 


Event points. ‘The order of segments in SLS can change at certain points only: whenever 
the sweep line moves over a segment endpoint or a point of intersection of two segments 
from S. Such a point is referred to as an event point (EP) of the sweep. Therefore we 
can reduce the conceptually continuous process of moving the sweep line over the plane 
to a discrete process that moves the line from EP to EP. This discretization allows for 
an efficient computation. 

At any EP several events can happen simultaneously: several segments can start 
and/or end and at the same point a couple of other segments can intersect. In fact 
the sweep line does not even make a difference between any two event points that have 
the same x-coordinate. To properly resolve the order of processing, EPs are considered 
in lexicographic order and wherever several events happen at a single point, these are 
considered simultaneously as a single EP. In this light, the sweep line is actually not a 
line but an infinitesimal step function (see Figure 5.1). 


Event point schedule. In contrast to the one-dimensional situation, in the plane not all 
EP are known in advance because the points of intersection are discovered during the 
algorithm only. In order to be able to determine the next EP at any time, we use a 
priority queue data structure, the so-called event point schedule (EPS). 

Along with every EP p store a list end(p) of all segments that end at p, a list begin(p) 
of all segments that begin at p, and a list int(p) of all segments in SLS that intersect at 
p a segment that is adjacent to it in SLS. 

Along with every segment we store pointers to all its appearances in an int(-) list of 
some EP. As a segment appears in such a list only if it intersects one of its neighbors 
there, every segment needs to store at most two such pointers. 
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(a) Before. (b) After. 


Figure 5.1: Handling an event point p. Ending segments are shown red (dashed), 
starting segments green (dotted), and passing segments blue (solid). 


Invariants. The following conditions can be shown to hold before and after any event 
point is handled. (We will not formally prove this here.) In particular, the last condition 
at the end of the sweep implies the correctness of the algorithm. 


1. L is the sequence of segments from S that intersect £, ordered by y-coordinate of 
their point of intersection. 


2. E contains all endpoints of segments from S and all points where two segments that 
are adjacent in L intersect to the right of £. 


3. All pairs from ($) that intersect to the left of £ have been reported. 


Event point handling. An EP p is processed as follows. 
1. If end(p) Uint(p) = Ø, localize p in L. 
2. Report all pairs of segments from end(p) U begin(p) Uint(p) as intersecting. 
3. Remove all segments in end(p) from L. 
4. Reverse the subsequence in L that is formed by the segments from int(p). 
5. Insert segments from begin(p) into L, sorted by slope. 


6. Test the topmost and bottommost segment in L from begin(p) Uint(p) for intersec- 
tion with its successor and predecessor, respectively, and update EP if necessary. 
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Updating EPS. Insert an EP p corresponding to an intersection of two segments s and 
t. Without loss of generality let s be above t at the current position of £. 


1. If p does not yet appear in E, insert it. 


2. If s is contained in an int(-) list of some other EP q, where it intersects a segment 
t' from above: Remove both s and t’ from the int(-) list of q and possibly remove 
q from E (if end(q) U begin(q) U int(q) = Ø). Proceed analogously in case that t 
is contained in an int(-) list of some other EP, where it intersects a segment from 
below. 


3. Insert s and t into int(p). 


Sweep. 
1. Insert all segment endpoints into begin(-)/end(-) list of a corresponding EP in E. 


2. As long as E #9), handle the first EP and then remove it from E. 


Runtime analysis. Initialization: O(nlogn). Processing of an EP p: 
O(#intersecting pairs + lend(p)|log n + |int(p)| + |begin(p)|log n + logn). 


In total: O(k+nlogn+klog n) = O((n+k) log n), where k is the number of intersecting 
pairs in S. 


Space analysis. Clearly |S| < n. At begin we have |E| < 2n and |S| = 0. Furthermore the 
number of additional EPs corresponding to points of intersection is always bounded by 
2|S|. Thus the space needed is O(n). 


Theorem 5.9 Problem|5.4) and Problem|5. 5] can be solved in O((n+k)logn) time and 
O(n) space. O 


Theorem 5.10 Problem|5. 1) Problem|5.2] and Problem|5.3] can be solved in O(nlog n) 
tume and O(n) space. O 


Exercise 5.11 Flesh out the details of the sweep line algorithm for Problem|5.Q that is 
referred to in Theorem|5.10, What if you have to construct the intersection rather 


than just to decide whether or not 1t 1s empty? 


Exercise 5.12 You are given n azis—parallel rectangles in R? with their bottom sides 
lying on the x-azts. Construct their union in O(nlogn) time. 
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5.3 Improvements 


The basic ingredients of the line sweep algorithm go back to work by Bentley and 
Ottmann from 1979. The particular formulation discussed here, which takes all 
possible degeneracies into account, is due to Mehlhorn and Naher [9]. 

Theorem is obviously optimal for Problem because this is just the 2- 
dimensional generalization of Element Uniqueness (see Section [1.1). One might sus- 
pect there is also a similar lower bound of O(nlog n) for Problem and Problem 
However, this is not the case: Both can be solved in O(n) time, albeit using a very 
complicated algorithm of Chazelle (the famous triangulation algorithm). 

Similarly, it is not clear why O(n log n+k) time should not be possible in Theorem]j5.9] 
Indeed this was a prominent open problem in the 1980's that has been solved in several 
steps by Chazelle and Edelsbrunner in 1988. The journal version of their paper 
consists of 54 pages and the space usage is suboptimal O(n + k). 

Clarkson and Shor [6] and independently Mulmuley [10) [11] described randomized 
algorithms with expected runtime O(nlogn+k) using O(n) and O(n + k), respectively, 
space. 

An optimal deterministic algorithm, with runtime O(nlogn + k) and using O(n) 
space, is known since 1995 only due to Balaban [1]. 


5.4 Algebraic degree of geometric primitives 


We already have encountered different notions of complexity during this course: We can 
analyze the time complexity and the space complexity of algorithms and both usually 
depend on the size of the input. In addition we have seen examples of output-sensitive 
algorithms, whose complexity also depends on the size of the output. In this section, 
we will discuss a different kind of complexity that is the algebraic complexity of the 
underlying geometric primitives. In this way, we try to shed a bit of light into the 
bottom layer of geometric algorithms that is often swept under the rug because it consists 
of constant time operations only. Nevertheless, it would be a mistake to disregard this 
layer completely, because in most—if not every—real world application it plays a crucial 
role, by affecting efficiency as well as correctness. Regarding efficiency, the value of the 
constants involved can make a big difference. The possible effects on correctness will 
hopefully become clear in the course of this section. 

In all geometric algorithms there are some fundamental geometric predicates and/or 
constructions on the bottom level. Both are operations on geometric objects, the differ- 
ence is only in the result: The result of a construction is a geometric object (for instance, 
the point common to two non-parallel lines), whereas the result of a predicate is Boolean 
(true or false). 

Geometric predicates and constructions in turn are based on fundamental arithmetic 
operations. For instance, we formulated planar convex hull algorithms in terms of an 
orientation predicate—given three points p, q,r € R?, is r strictly to the right of the ori- 
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ented line pr— which can be implemented using multiplication and addition/subtraction 
of coordinates/numbers. When using limited precision arithmetic, it is important to keep 
an eye on the size of the expressions that occur during an evaluation of such a predicate. 

In Exercise we have seen that the orientation predicate can be computed by 
evaluating a polynomial of degree two in the input coordinates and, therefore, we say 
that 


Proposition 5.13 The rightturn/orientation predicate for three points in R? has alge- 
braic degree two. 


The degree of a predicate depends on the algebraic expression used to evaluate it. Any 
such expression is intimately tied to the representation of the geometric objects used. 
Where not stated otherwise, we assume that geometric objects are represented as de- 
scribed in Section So in the proposition above we assume that points are repre- 
sented using Cartesian coordinates. The situation might be different, if, say, a polar 
representation is used instead. 

But even once a representation is agreed upon, there is no obvious correspondence to 
an algebraic expression that describes a given predicate. In fact, it is not even clear why 
such an expression should exist in general. But if it does—as, for instance, in case of the 
orientation predicate—we prefer to work with a polynomial of smallest possible degree. 
Therefore we define the (algebraic) degree of a geometric predicate as the minimum 
degree of a polynomial that defines it (predicate true <=> polynomial positive). So 
there still is something to be shown in order to complete Proposition [5.13] 


Exercise 5.14 Show that there 1s no polynomial of degree at most one that describes 
the orientation test in R?. 

Hint: Let p = (0,0), q = (x,0), and r = (0,y) and show that there is no linear 
function in x and y that distinguishes the region where pqr form a rightturn from 
its complement. 


The degree of a predicate corresponds to the size of numbers that arise during its 
evaluation. If all input coordinates are k-bit integers then the numbers that occur during 
an evaluation of a degree d predicate on these coordinates are of size aboutt] dk. If the 
number type used for the computation can represent all such numbers, the predicate 
can be evaluated exactly and thus always correctly. For instance, when using a standard 
IEEE double precision floating point implementation which has a mantissa length of 53 
bit then the above orientation predicate can be evaluated exactly if the input coordinates 
are integers between 0 and 2”°, say. 

Let us now get back to the line segment intersection problem. It needs a few new 
geometric primitives: most prominently, constructing the intersection point of two line 
segments. 


1It is only about dk because not only multiplications play a role but also additions. As a rule of thumb, 
a multiplication may double the bitsize, while an addition may increase it by one. 
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Two segments. Given two line segments s = Aa + (1 — à)b, à € [0,1] and t = uc + (1 — 
u)d, u € [0,1], it is a simple exercise in linear algebra to compute s N t. Note that s Nt 
is either empty or a single point or a line segment. 

For a = (ax, ay), b = (bx, by), € = (cx, cy), and d = (dx, dy) we obtain two linear 
equations in two variables A and u. 


Rearranging terms yields 


Ma, bx) f u(dx Cx) = dy by 
Alay = by) + uldu = ey) = dy = by 


Assuming that the lines underlying s and t have distinct slopes (that is, they are neither 
identical nor parallel) we have 


a, dy 1 0 
D= a,—b, d,—Cx L by by 1 0 40 
dy — by dy — Cy Cx Cy 0 1 
dx dy 0 1 
and using Cramer’s rule 
nes Ax Dx dk — Cx wA _1|ax—bx dy — bx 
= D| dy—by dy—cy BED ay = dy dy—by 


To test if s and t intersect, we can—after having sorted out the degenerate case in 
which both segments have the same slope—compute A and u and then check whether 
A, u € [0,1]. 

Observe that both A and D result from multiplying two differences of input coor- 
dinates. Computing the x-coordinate of the point of intersection via b, + A(a, — bx) 
uses another multiplication. Overall this computation yields a fraction whose numerator 
is a polynomial of degree three in the input coordinates and whose denominator is a 
polynomial of degree two in the input coordinates. 

In order to maintain the sorted order of event points in the EPS, we need to compare 
event points lexicographically. In case that both are intersection points, this amounts 
to comparing two fractions of the type discussed above. In order to formulate such a 
comparison as a polynomial, we have to cross-multiply the denominators, and so obtain 
a polynomial of degree 3+ 2 = 5. It can be shown (but we will not do it here) that this 
bound is tight and so we conclude that 


Proposition 5.15 The algebraic degree of the predicate that compares two intersection 
points of line segments lexicographically ts five. 
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Therefore the coordinate range in which this predicate can be evaluated exactly using 
IEEE double precision numbers shrinks down to integers between 0 and about 2*% = 
1'024. 


Exercise 5.16 What is the algebraic degree of the predicate checking whether two line 
segments intersect? (Above we were interested in the actual intersection point, 
but now we consider the predicate that merely answers the question whether two 
segments intersect or not by yes or no). 


Exercise 5.17 The goal of this exercise 1s to analyze the algebraic degree of the In- 
Circle predicate: Given three points p, q,r € R? that define a circle C and a fourth 
point s, ts s located inside C or not? To this end we employ the so-called parabolic 
lifting map, which will also play a prominent role in the next chapter of the course. 


a) The parabolic lifting map £: R? — R? is defined for a point p = (x,y) € R? by 
{(p) = (x,y, x? +y?). For a circle C C R? of positive radius, show that the ‘lifted 
circle” €(C) = {€(p) | p € C} is contained in a unique plane hc C RÌ. Moreover, a 
point p € R? is strictly inside (outside, respectively) of C if and only if the lifted 
point l(p) is strictly below (above, respectively) hc. 


b) What is the degree of the polynomial you need to evaluate in order to decide 
the InCircle predicate? 


5.5 Red-Blue Intersections 


Although the Bentley-Ottmann sweep appears to be rather simple, its implementation 
is not straightforward. For once, the original formulation did not take care of possible 
degeneracies—as we did in the preceding section. But also the algebraic degree of the 
predicates used is comparatively high. In particular, comparing two points of intersection 
lexicographically is a predicate of degree five, that is, in order to compute it, we need to 
evaluate a polynomial of degree five. When evaluating such a predicate with plain floating 
point arithmetic, one easily gets incorrect results due to limited precision roundoff errors. 
Such failures frequently render the whole computation useless. The line sweep algorithm 
is problematic in this respect, as a failure to detect one single point of intersection often 
implies that no other intersection of the involved segments to the right is found. 

In general, predicates of degree four are needed to construct the arrangement of line 
segments because one needs to determine the orientation of a triangle formed by three 
segments. This is basically an orientation test where two points are segment endpoints 
and the third point is an intersection point of segments. Given that the coordinates of 
the latter are fractions, whose numerator is a degree three polynomial and the common 
denominator is a degree two polynomial, we obtain a degree four polynomial overall. 

Motivated by the Map overlay application we consider here a restricted case. In the 
red-blue intersection problem, the input consists of two sets R (red) and B (blue) of 
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segments such that the segments in each set are interzor-disjoint, that is, for any pair of 
distinct segments their relative interior is disjoint. In this case there are no triangles of 
intersecting segments, and it turns out that predicates of degree two suffice to construct 
the arrangement. This is optimal because already the intersection test for two segments 
is a predicate of degree two. 


Predicates of degree two. Restricting to degree two predicates has certain consequences. 
While it is possible to determine the position of a segment endpoint relative to a(nother) 
segment using an orientation test, one cannot, for example, compare a segment endpoint 
with a point of intersection lexicographically. Even computing the coordinates for a 
point of intersection is not possible. Therefore the output of intersection points is done 
implicitly, as “intersection of s and t”. 

Graphically speaking we can deform any segment—keeping its endpoints fixed—as 
long as it remains monotone and it does not reach or cross any segment endpoint (it 
did not touch before). With help of degree two predicates there is no way to tell the 
difference. 


Witnesses. Using such transformations the processing of intersection points is deferred as 
long as possible (lazy computation). The last possible point w(s, t) where an intersection 
between two segments s and t has to be processed we call the witness of the intersection. 
The witness w(s,t) is the lexicographically smallest segment endpoint that is located 
within the closed wedge formed by the two intersecting segments s and t to the right 
of the point of intersection (Figure 5.2). Note that each such wedge contains at least 
two segment endpoints, namely the right endpoints of the two intersecting segments. 
Therefore for any pair of intersecting segments its witness is well-defined. 


t 


Figure 5.2: The witness w(s,t) of an intersection s Nt. 


As a consequence, only the segment endpoints are EPs and the EPS can be determined 
by lexicographic sorting during initialization. On the other hand, the SLS structure gets 
more complicated because its order of segments does not necessarily reflect the order in 
which the segments intersect the sweep line. 


Invariants. The invariants of the algorithm have to take the relaxed notion of order into 
account. Denote the sweep line by £. 
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1. L is the sequence of segments from S = RUB intersecting f; s appears before t in 
L => s intersects l above t or s intersects t and the witness of this intersection is 
to the right of £. 


2. All intersections of segments from S whose witness is to the left of £ have been 
reported. 


SLS Data Structure. The SLS structure consist of three levels. We use the fact that 
segments of the same color do not interact, except by possibly sharing endpoints. 


1. Collect adjacent segments of the same color in bundles, stored as balanced search 
trees. For each bundle store pointers to the topmost and bottommost segment. 
(As the segments within one bundle are interior-disjoint, their order remains static 
and thus correct under possible deformations due to lazy computation.) 


2. All bundles are stored in a doubly linked list, sorted by y-coordinate. 


3. All red bundles are stored in a balanced search tree (bundle tree). 


Bundle 
Tree 


t 4 | 
1 
| 
| 
| 


List 


Figure 5.3: Graphical representation of the SLS data structure. 


The search tree structure should support insert, delete, split and merge in (amortized) 
logarithmic time each. For instance, splay trees meet these requirements. (If you 
have never heard about splay trees so far, you do not need to know for our purposes 
here. Just treat them as a black box. However, you should take a note and read about 
splay trees still. They are a fascinating data structure.) 


EP handling. An EP p is processed as follows. 


1. We first want to classify all bundles with respect to their position relative to p: as 
either lying above or below p or as ending at p. There are < 2 bundles that have 
no clear such characterization. 
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(a) Localize p in bundle tree > Find < 2 bundles without clear characteriza- 
tion. For the localization we use the pointers to the topmost and bottommost 
segment within a bundle. 


(b) Localize p in < 2 red bundles found and split them at p. (If p is above the 
topmost or below the bottommost segment of the bundle, there is no split.) 
All red bundles are now either above, ending, or below with respect to p. 


(c) Localize p within the blue bundles by linear search. 


(d) Localize p in the < 2 blue bundles found and split them at p. (If p is above the 
topmost or below the bottommost segment of the bundle, there is no split.) 
All bundles are now either above, ending, or below with respect to p. 


2. Run through the list of bundles around p. More precisely, start from one of the 
bundles containing p found in Step 1 and walk up in the doubly linked list of 
bundles until two successive bundles (hence of opposite color) are both above p. 
Similarly, walk down in the doubly linked list of bundles, until two successive 
bundles are both below p. Handle all adjacent pairs of bundles (A,B) that are in 
wrong order and report all pairs of segments as intersecting. (Exchange A and B 
in the bundle list and merge them with their new neighbors.) 


3. Report all two-colored pairs from begin(p) x end(p) as intersecting. 


4. Remove ending bundles and insert starting segments, sorted by slope and bundled 
by color and possibly merge with the closest bundle above or below. 


Remark: As both the red and the blue segments are interior-disjoint, at every EP there 
can be at most one segment that passes through the EP. Should this happen, for the 
purpose of EP processing split this segment into two, one ending and one starting at this 
EP. But make sure to not report an intersection between the two parts! 


Analysis. Sorting the EPS: O(nlogn) time. Every EP generates a constant number of 
tree searches and splits of O(logn) each. Every exchange in Step |2| generates at least 
one intersection. New bundles are created only by inserting a new segment or by one 
of the constant number of splits at an EP. Therefore O(n) bundles are created in total. 
The total number of merge operations is O(n), as every merge kills one bundle and O(n) 
bundles are created overall. The linear search in steps |1}and [2|can be charged either to 
the ending bundle or—for continuing bundles—to the subsequent merge operation. In 
summary, we have a runtime of O(nlogn + k) and space usage is linear obviously. 


Theorem 5.18 For two sets R and B, each consisting of intertor-disjoint line segments 
in R?, one can find all intersecting pairs of segments in O(nlogn + k) time and 
linear space, using predicates of maximum degree two. Here n = |R| + |B| and k is 
the number of intersecting pairs. 
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Remarks. 'The first optimal algorithm for the red-blue intersection problem was pub- 
lished in 1988 by Harry Mairson and Jorge Stolfi [7]. In 1994 Timothy Chan [3] described 
a trapezoidal-sweep algorithm that uses predicates of degree three only. 'The approach 
discussed above is due to Andrea Mantler and Jack Snoeyink [8] from 2000. 


Exercise 5.19 Let S be a set of n segments each of which is either horizontal or 
vertical. Describe an O(nlogn) time and O(n) space algorithm that counts the 


number of pairs in (3) that intersect. 


Questions 


14. How can one test whether a polygon on n vertices 1s simple? Describe an 
O(nlogn) time algorithm. 


15. How can one test whether two simple polygons on altogether n vertices inter- 
sect? Describe an O(nlogn) time algorithm. 


16. How does the line sweep algorithm work that finds all k intersecting pairs 
among n line segments in R?? Describe the algorithm, using O((n + k) log n) 
time and O(n) space. In particular, explain the data structures used, how event 
points are handled, and how to cope with degeneracies. 


17. Given two line segments s and t in R? whose endpoints have integer coordi- 
nates in [0,2P); suppose s and t intersect in a single point q, what can you 
say about the coordinates of q? Give a good (tight up to small additive number 
of bits) upper bound on the size of these coordinates. (No need to prove tightness, 
that is, give an example which achieves the bound.) 


18. What ts the degree of a predicate and why is it an important parameter? What 
is the degree of the orientation test and the incircle test in R?? Explain the 
term and give two reasons for its relevance. Provide tight upper bounds for the 
two predicates mentioned. (No need to prove tightness.) 


19. What 1s the map overlay problem and how can 1t be solved optimally using 
degree two predicates only? Give the problem definition and explain the term 
“interior-disjoint”. Explain the bundle-sweep algorithm, in particular, the data 
structures used, how an event point is processed, and the concept of witnesses. 
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Chapter 6 


Delaunay Triangulations 


In Chapter [2] we have discussed triangulations of simple polygons. A triangulation nicely 
partitions a polygon into triangles, which allows, for instance, to easily compute the 
area or a guarding of the polygon. Another typical application scenario is to use a 
triangulation T for interpolation: Suppose a function f is defined on the vertices of the 
polygon P, and we want to extend it “reasonably” and continuously to P”. 'Then for a 
point p € P” find a triangle t of T that contains p. As p can be written as a convex 
combination aon Aivi of the vertices v1, V2, v3 of t, we just use the same coefficients to 
obtain an interpolation f(p) := Po AMif(v;) of the function values. 

If triangulations are a useful tool when working with polygons, they might also turn 
out useful to deal with other geometric objects, for instance, point sets. But what could 
be a triangulation of a point set? Polygons have a clearly defined interior, which naturally 
lends itself to be covered by smaller polygons such as triangles. A point set does not have 
an interior, except ... Here the notion of convex hull comes handy, because it allows us 
to treat a point set as a convex polygon. Actually, not really a convex polygon, because 
points in the interior of the convex hull should not be ignored completely. But one way to 
think of a point set is as a convex polygon—its convex hull—possibly with some holes— 
which are points—in its interior. A triangulation should then partition the convex hull 
while respecting the points in the interior, as shown in the example in Figure [6.1b] 


(a) Simple polygon triangulation. (b) Point set triangulation. (c) Not a triangulation. 


Figure 6.1: Examples of (non-)triangulations. 
In contrast, the example depicted in Figure nicely subdivides the convex hull 
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but should not be regarded a triangulation: Two points in the interior are not respected 
but simply swallowed by a large triangle. 
This interpretation directly leads to the following adaption of Definition 


Definition 6.1 A triangulation of a finite point set P C R? is a collection T of triangles, 
such that 


(1) conv(P) = Utes T; 
(2) P = Utes V(T); and 


(3) for every distinct pair T,U € T, the intersection TNU is either a common 
vertez, or a common edge, or empty. 


Just as for polygons, triangulations are universally available for point sets, meaning 
that (almost) every point set admits at least one. 


Proposition 6.2 Every set P C R? ofn > 3 points has a triangulation, unless all points 
in P are collinear. 


Proof. In order to construct a triangulation for P, consider the lexicographically sorted 
sequence Pi,...,Pn Of points in P. Let m be minimal such that p;,...,Pm are not 
collinear. We triangulate pi,...,Pm by connecting Pm to all of p;,,...,pm-1 (which are 
on a common line), see Figure 


(a) Getting started. (b) Adding a point. 


Figure 6.2: Constructing the scan triangulation of P. 


Then we add Pm+1,---»Pn- When adding pi, for i > m, we connect p; with all 
vertices of C;_, := conv((p1,...,pi-1)) that it “sees”, that is, every vertex v of C;_; for 
which piv Ci- = {v}. In particular, among these vertices are the two points of tangency 
from pi to Ci_1, which shows that we always add triangles (Figure whose union 
after each step covers C;. O 


The triangulation that is constructed in Proposition|6.2|is called a scan triangulation. 
Such a triangulation (Figure |6.3al (left) shows a larger example) is usually “ugly”, though, 
since it tends to have many long and skinny triangles. This is not just an aesthetic deficit. 
Having long and skinny triangles means that the vertices of a triangle tend to be spread 
out far from each other. You can probably imagine that such a behavior is undesirable, 
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for instance, in the context of interpolation. In contrast, the Delaunay triangulation 
of the same point set (Figure [6.3b) looks much nicer, and we will discuss in the next 
section how to get this triangulation. 


(a) Scan triangulation. (b) Delaunay triangulation. 


Figure 6.3: Two triangulations of the same set of 50 points. 


Exercise 6.3 Describe an O(nlogn) time algorithm to construct a scan triangulation 
for a set of n points in R?. 


On another note, if you look closely into the SLR-algorithm to compute planar convex 
hull that was discussed in Chapter |3| then you will realize that we also could have used 
this algorithm in the proof of Proposition Whenever a point is discarded during 
SLR, a triangle is added to the polygon that eventually becomes the convex hull. 

In view of the preceding chapter, we may regard a triangulation as a plane graph: 
the vertices are the points in P and there is an edge between two points p % q, if and 
only if there is a triangle with vertices p and q. Therefore we can use Euler's formula to 
determine the number of edges in a triangulation. 


Lemma 6.4 Any triangulation of a set P C R? ofn points has exactly 3n—h—3 edges, 
where h is the number of points from P on Odconv(P). 


Proof. Consider a triangulation T of P and denote by E the set of edges and by F the 
set of faces of T. We count the number of edge-face incidences in two ways. Denote 
J={(e,f)e€ Ex F : ec of}. 

On the one hand, every edge is incident to exactly two faces and therefore |J| = 2]E]. 
On the other hand, every bounded face of T is a triangle and the unbounded face has h 
edges on its boundary. Therefore, |J| = 3([F| — 1) +h. 
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Together we obtain 3|F| = 2|E|— h + 3. Using Euler’s formula (3n — 3|E| + 3|F| = 6) 
we conclude that 3n — |El — h +3 = 6 and so |E| = 3n — h — 3. O 


In graph theory, the term “triangulation” is sometimes used as a synonym for “maxi- 
mal planar”. But geometric triangulations are different, they are maximal planar in the 
sense that no straight-line edge can be added without sacrificing planarity. 


Corollary 6.5 A triangulation of a set P C R? of n points is maximal planar, if and 
only if conv(P) is a triangle. 


Proof. Combine Lemma and Lemma O 


Exercise 6.6 Find for every n > 3 a simple polygon P with n vertices such that P has 
exactly one triangulation. P should be in general position, meaning that no three 
vertices are collinear. 


Exercise 6.7 Show that every set of n > 5 points in general position (no three points 
are collinear) has at least two different triangulations. 

Hint: Show first that every set of five points in general position contains a conver 
4-hole, that 1s, a subset of four points that span a convex quadrilateral that does 
not contain the fifth point. 


6.1 The Empty Circle Property 
We will now move on to study the ominous and supposedly nice Delaunay triangulations 
mentioned above. They are defined in terms of an empty circumcircle property for 


triangles. The circumcircle of a triangle is the unique circle passing through the three 
vertices of the triangle, see Figure 


Figure 6.4: Czrcumcircle of a triangle. 


Definition 6.8 A triangulation of a finite point set P C R? is called a Delaunay triangu- 
lation, 2f the circumcircle of every triangle is empty, that is, there is no point from 
P in its interior. 


68 


CG 2013 6.1. The Empty Circle Property 


Consider the example depicted in Figure It shows a Delaunay triangulation of a 
set of six points: The circumcircles of all five triangles are empty (we also say that the 
triangles satisfy the empty circle property). The dashed circle is not empty, but that is 
fine, since it is not a circumcircle of any triangle. 


Figure 6.5: All triangles satisfy the empty circle property. 


It is instructive to look at the case of four points in convex position. Obviously, there 
are two possible triangulations, but in general, only one of them will be Delaunay, see 
Figure and If the four points are on a common circle, though, this circle is 
empty; at the same time it is the circumcircle of all possible triangles; therefore, both 
triangulations of the point set are Delaunay, see Figure [6.6c] 


9 


(a) Delaunay triangulation. (b) Non-Delaunay triangulation. (c) Two Delaunay triangulations. 


Figure 6.6: Trzangulations of four points in convex position. 


Proposition 6.9 Given a set P C R? of four points that are in convex position but not 
cocircular. Then P has exactly one Delaunay triangulation. 


Proof. Consider a convex polygon P = pqrs. There are two triangulation of P: a 
triangulation J, using the edge pr and a triangulation J, using the edge qs. 

Consider the family C, of circles through pr, which contains the circumcircles Cy = 
pqr and C; = rsp of the triangles in J;. By assumption s is not on C;. If s is outside of 
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Cı, then q is outside of Ci: Consider the process of continuously moving from C; to Ci 
in C; (Figure|6.7a); the point q is “left behind” immediately when going beyond C, and 


ru 


only the final circle C; “grabs” the point s. 


C2 
(a) Going from Cy to Cį in Cy. (b) Going from C, to Ca in C2. 


Figure 6.7: Circumcircles and containment for triangulations of four points. 


Similarly, consider the family C, of circles through pq, which contains the circumcir- 
cles C; = pqr and C = spq, the latter belonging to a triangle in J2. As s is outside of 
C4, it follows that r is inside C2: Consider the process of continuously moving from C; 
to C2 in C2 (Figure [6.7b) ; the point r is on C, and remains within the circle all the way 
up to C2. This shows that J, is Delaunay, whereas J, is not. 

The case that s is located inside C, is symmetric: just cyclically shift the roles of 
pqrs to qrsp. O 


6.2 The Lawson Flip algorithm 


It is not clear yet that every point set actually has a Delaunay triangulation (given that 
not all points are on a common line). In this and the next two sections, we will prove 
that this is the case. The proof is algorithmic. Here is the Lawson flip algorithm for a 
set P of n points. 


1. Compute some triangulation of P (for example, the scan triangulation). 


2. While there exists a subtriangulation of four points in convex position that is not 
Delaunay (like in Figure [6.6b), replace this subtriangulation by the other triangu- 
lation of the four points (Figure [6.6a). 


We call the replacement operation in the second step a (Lawson) flip. 


Theorem 6.10 Let P C R? be a set of n points, equipped with some triangulation T. 
The Lawson flip algorithm terminates after at most de = O(n?) flips, and the 
resulting triangulation D is a Delaunay triangulation of P. 
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We will prove Theorem in two steps: First we show that the program described 
above always terminates and, therefore, is an algorithm, indeed (Section [6.3). Then we 
show that the algorithm does what it claims to do, namely the result is a Delaunay 
triangulation (Section (6.4). 

6.3 Termination of the Lawson Flip Algorithm: The Lifting Map 


In order to prove Theorem we invoke the (parabolic) lifting map. This is the 
following: given a point p = (x,y) € R?, its lifting L(p) is the point 


líp) = (x,y, x? + y?) e R®. 
Geometrically, £ “lifts” the point vertically up until it lies on the unit paraboloid 
lauz |z=x? +y7} € RS, 


see Figure 


(a) The lifting map. (b) Points on/inside/outside a circle are lifted to 
points on/below/above a plane. 


Figure 6.8: The lifting map: circles map to planes. 


Recall the following important property of the lifting map that we proved in Exercise 
It is illustrated in Figure 


Lemma 6.11 Let C C R? be a circle of positive radius. The “lifted circle” L(C) = {L(p) | 
p € C} ts contained in a unique plane hc C R?. Moreover, a point p € R? is strictly 
inside (outside, respectively) of C if and only if the lifted point L(p) is strictly below 
(above, respectively) hc. 


Using the lifting map, we can now prove Theorem Let us fix the point set P for 
this and the next section. First, we need to argue that the algorithm indeed terminates 
(if you think about it a little, this is not obvious). So let us interpret a flip operation in 
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the lifted picture. The flip involves four points in convex position in R?, and their lifted 
images form a tetrahedron in R? (think about why this tetrahedron cannot be “flat”). 

The tetrahedron is made up of four triangles; when you look at it from the top, you 
see two of the triangles, and when you look from the bottom, you see the other two. In 
fact, what you see from the top and the bottom are the lifted images of the two possible 
triangulations of the four-point set in R? that is involved in the flip. 

Here is the crucial fact that follows from Lemma The two top triangles come 
from the non-Delaunay triangulation before the flip, see Figure The reason is that 
both top triangles have the respective fourth point below them, meaning that in R?, 
the circumcircles of these triangles contain the respective fourth point—the empty circle 
property is violated. In contrast, the bottom two triangles come from the Delaunay 
triangulation of the four points: they both have the respective fourth point above them, 
meaning that in R?, the circumcircles of the triangles do not contain the respective fourth 


point, see Figure [6.9b] 


(a) Before the flip: the top two triangles of (b) After the flip: the bottom two triangles of the 
the tetrahedron and the corresponding non- tetrahedron and the corresponding Delaunay 
Delaunay triangulation in the plane. triangulation in the plane. 


Figure 6.9: Lawson flip: the height of the surface of lifted triangles decreases. 


In the lifted picture, a Lawson flip can therefore be interpreted as an operation that 
replaces the top two triangles of a tetrahedron by the bottom two ones. If we consider 
the lifted image of the current triangulation, we therefore have a surface in R? whose 
pointwise height can only decrease through Lawson flips. In particular, once an edge 
has been flipped, this edge will be strictly above the resulting surface and can therefore 
never be flipped a second time. Since n points can span at most La) edges, the bound 
on the number of flips follows. 


6.4 Correctness of the Lawson Flip Algorithm 


It remains to show that the triangulation of P that we get upon termination of the 
Lawson flip algorithm is indeed a Delaunay triangulation. Here is a first observation 
telling us that the triangulation is “locally Delaunay”. 
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Observation 6.12 Let A, A’ be two adjacent triangles in the triangulation D that results 
from the Lawson flip algorithm. Then the circumcircle of A does not have any 
verter of A’ in its interior, and vice versa. 


If the two triangles together form a convex quadrilateral, this follows from the fact 
that the Lawson flip algorithm did not flip the common edge of A and A’. If the four 
vertices are not in convex position, this is basic geometry: given a triangle A, its cir- 
cumcircle C can only contains points of C \ A that form a convex quadrilateral with the 
vertices of A. 

Now we show that the triangulation is also “globally Delaunay”. 


Proposition 6.13 The triangulation D that results from the Lawson flip algorithm is 
a Delaunay triangulation. 


Proof. Suppose for contradiction that there is some triangle A € D and some point 
p € P strictly inside the circumcircle C of A. Among all such pairs (A, p), we choose one 
for which we the distance of p to A is minimal. Note that this distance is positive since 
D is a triangulation of P. The situation is as depicted in Figure 


(a) A point p inside the cir- (b) The edge e of A closest top (c) The circumcircle C’ of A’ also 
cumcircle C of a triangle A. and the second triangle A’ contains p, and p is closer to 
incident to e. A’ than to A. 


Figure 6.10: Correctness of the Lawson fitp algorithm. 


Now consider the edge e of A that is facing p. There must be another triangle A’ in 
D that is incident to the edge e. By the local Delaunay property of D, the third vertex q 
of A’ is on or outside of C, see Figure [6.10b] But then the circumcircle C’ of A’ contains 
the whole portion of C on p’s side of e, hence it also contains p; moreover, p is closer to 
A’ than to A (Figure [6.10c). But this is a contradiction to our choice of A and p. Hence 
there was no (A, p), and D is a Delaunay triangulation. O 


Exercise 6.14 The Euclidean minimum spanning tree (EMST') of a finite point set 
P C R? is a spanning tree for which the sum of the edge lengths is minimum 
(among all spanning trees of P). Show: 
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a) Every EMST of P is a plane graph. 


b) Every EMST of P contains a closest pair, i.e., an edge between two points 
p,q € P that have minimum distance to each other among all point pairs in (aja 


c) Every Delaunay Triangulation of P contains an EMST of P. 


6.5 The Delaunay Graph 


Despite the fact that a point set may have more than one Delaunay triangulation, there 
are certain edges that are present in every Delaunay triangulation, for instance, the edges 
of the convex hull. 


Definition 6.15 The Delaunay graph of P C R? consists of all line segments pq, for 
p, q € P, that are contained in every Delaunay triangulation of P. 


The following characterizes the edges of the Delaunay graph. 


Lemma 6.16 The segment pq, for p,q € P, ts in the Delaunay graph of P if and only 
if there exists a circle through p and q that has p and q on its boundary and all 
other points of P are strictly outside. 


Proof. “=”: Let pq be an edge in the Delaunay graph of P, and let D be a Delaunay 
triangulation of P. Then there exists a triangle A = pqr in D, whose circumcircle C does 
not contain any point from P in its interior. 

If there is a point s on OC such that Ts intersects pq, then let A’ = pqt denote the 
other (4 A) triangle in D that is incident to pq (Figure [6.11a). Flipping the edge pq 
to rt yields another Delaunay triangulation of P that does not contain the edge pq, in 
contradiction to pq being an edge in the Delaunay graph of P. Therefore, there is no 
such point s. 


(a) Another point s € ðC. (b) Moving C away from s. 


Figure 6.11: Characterization of edges in the Delaunay graph (1). 
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Otherwise we can slightly change the circle C by moving away from r while keeping 
p and q on the circle. As P is a finite point set, we can do such a modification without 
catching another point from P with the circle. In this way we obtain a circle C’ through 
p and q such that all other points from P are strictly outside C’ (Figure [6.12b). 

“2”: Let D be a Delaunay triangulation of P. If pq is not an edge of D, there must 
be another edge of D that crosses pq (otherwise, we could add pq to D and still have 
a plane graph, a contradiction to D being a triangulation of P). Let rs denote the first 
edge of D that intersects the directed line segment pq. 

Consider the triangle A of D that is incident to rs on the side that faces p (given 
that Ts intersects pq this is a well defined direction). By the choice of rs neither of the 
other two edges of A intersects pq, and p £ A” because A is part of a triangulation of P. 
The only remaining option is that p is a vertex of A = prs. As A is part of a Delaunay 
triangulation, its circumcircle C4 is empty (i.e., Ca NP =0). 

Consider now a circle C through p and q, which exists by assumption. Fixing p and q, 
expand C towards r to eventually obtain the circle C’ through p, q, and r (Figure|6.12a)). 
Recall that r and s are on different sides of the line through p and q. Therefore, s lies 
strictly outside of C’. Next fix p and r and expand C’ towards s to eventually obtain the 
circle Ca through p, r, and s (Figure [6.12b). Recall that s and q are on the same side 
of the line through p and r. Therefore, q € Ca, which is in contradiction to Ca being 
empty. It follows that there is no Delaunay triangulation of P that does not contain the 
edge pq. O 


A as 
(a) Expanding C towards r. (b) Expanding C’ towards s. 


Figure 6.12: Characterization of edges in the Delaunay graph (11). 


The Delaunay graph is useful to prove uniqueness of the Delaunay triangulation in 
case of general position. 


Corollary 6.17 Let P C R? be a finite set of points in general position, that is, no four 
points of P are cocircular. Then P has a unique Delaunay triangulation. O 
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6.6 Every Delaunay Triangulation Maximizes the Smallest Angle 


Why are we actually interested in Delaunay triangulations? After all, having empty 
circumcircles is not a goal in itself. But it turns out that Delaunay triangulations satisfy 
a number of interesting properties. Here we show just one of them. 

Recall that when we compared a scan triangulation with a Delaunay triangulation of 
the same point set in Figure|6.3| we claimed that the scan triangulation is “ugly” because 
it contains many long and skinny triangles. The triangles of the Delaunay triangulation, 
at least in this example, look much nicer, that is, much closer to an equilateral triangle. 
One way to quantify this “niceness” is to look at the angles that appear in a triangulation: 
If all angles are large, then all triangles are reasonably close to an equilateral triangle. 
Indeed, we will show that Delaunay triangulations maximize the smallest angle among 
all triangulations of a given point set. Note that this does not imply that there are no 
long and skinny triangles in a Delaunay triangulation. But if there is a long and skinny 
triangle in a Delaunay triangulation, then there is an at least as long and skinny triangle 
in every triangulation of the point set. 

Given a triangulation J of P, consider the sorted sequence A(T) = (1, X2,..., X3m) 
of interior angles, where m is the number of triangles (we have already remarked earlier 
that m is a function of P only and does not depend on J). Being sorted means that 
1 < A2 S -:: < Agm. Let T, T” be two triangulations of P. We say that A(T) < A(T’) 
if there exists some i for which a; < aq and aj = aj, for all j <i. (This is nothing but 
the lexicographic order on these sequences. ) 


Theorem 6.18 Let P C R? be a finite set of points in general position (not all collinear 
and no four cocircular). Let D* be the unique Delaunay triangulation of P, and let 
T be any triangulation of P. Then A(T) < A(D*). 


In particular, D* maximizes the smallest angle among all triangulations of P. 


(a) Four cocircular points and the (b) The situation before a flip. 
induced eight angles. 


Figure 6.13: Angle-optemality of Delaunay triangulations. 


76 


CG 2013 6.6. Every Delaunay Triangulation Maximizes the Smallest Angle 


Proof. We know that T can be transformed into D* through the Lawson flip algorithm, 
and we are done if we can show that each such flip lexicographically increases the sorted 
angle sequence. A flip replaces six interior angles by six other interior angles, and we 
will actually show that the smallest of the six angles strictly increases under the flip. 
This implies that the whole angle sequence increases lexicographically. 

Let us first look at the situation of four cocircular points, see Figure In this 
situation, the inscribed angle theorem (a generalization of Thales’ Theorem, stated 
below as Theorem [6.19) tells us that the eight depicted angles come in four equal pairs. 
For instance, the angles labeled a, at s and r are angles on the same side of the chord 
pq of the circle. 

In Figure we have the situation in which we perform a Lawson flip (replacing 
the solid with the dashed diagonal). By the symbol a (&, respectively) we denote an 
angle strictly smaller (larger, respectively) than a. Here are the six angles before the 
flip: 


X1 + A2, X3, X4, Xi, Az, X3 + X4. 
After the flip, we have 
Xi, A2, Az, Xy, A+ A4, Ag+ Xs. 


Now, for every angle after the flip there is at least one smaller angle before the flip: 


A > 0, 
A > Mz, 
a > Az, 
a > Xa, 
Qt, > OX, 
Ag +X, > Xz. 
It follows that the smallest angle strictly increases. O 


Theorem 6.19 (Inscribed Angle Theorem) Let C be a circle with center c and positive 
radius and p,q € C. Then the angle Zprq mod n = +Zpcq is the same, for allr € C. 


Proof. Without loss of generality we may assume that c is located to the left of or on 
the oriented line pq. 
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Figure 6.14: The Inscribed Angle Theorem with O := Zprq. 


Consider first the case that the triangle A = pqr 
contains c. Then A can be partitioned into three trian- 
gles: pcr, qcr, and cpq. All three triangles are isosce- 
les, because two sides of each form the radius of C. De- 
note x = Zprc, B = Zcrq, y = Zcpq, and ô = Zpcq 
(see the figure shown to the right). The angles we are 
interested in are 0 = Zprq = a+ B and ô, for which 
we have to show that 5 = 20. 

Indeed, the angle sum in A is n = 2(a+8+y) 
and the angle sum in the triangle cpq is 7 = ô + 2y. 
Combining both yields ô = 2(a + B) = 20. 


Next suppose that pqcr are in convex position and 
r is to the left of or on the oriented line pq. Without 
loss of generality let r be to the left of or on the oriented 
line qc. (The case that r lies to the right of or on the 
oriented line pc is symmetric.) Define a, f, y, 5 as 
above and observe that 0 = a— fs. Again have to show 
that 5 = 20. 

The angle sum in the triangle cpq is 7 = 5+ 2y 
and the angle sum in the triangle rpq is m= (a—f)+ 
aty+(y— fp) =2(a+y—f). Combining both yields 
5=nm—2y = 2(a—f) = 20. 
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It remains to consider the case that r is to the right of the 
oriented line pq. 

Consider the point r’ that is antipodal to r on C, and the 
quadrilateral Q = prqr’. We are interested in the angle q of 
Q at r. By Thales’ Theorem the inner angles of Q at p and q 
are both 7/2. Hence the angle sum of Q is 217 = 0+ + 27/2 
and so y =1T—0. 


O 


What happens in the case where the Delaunay triangulation is not unique? The 
following still holds. 


Theorem 6.20 Let P C R? be a finite set of points, not all on a line. Every Delaunay 
triangulation D of P maximizes the smallest angle among all triangulations J of P. 


Proof. Let D be some Delaunay triangulation of P. We infinitesimally perturb the points 
in P such that no four are on acommon circle anymore. Then the Delaunay triangulation 
becomes unique (Corollary [6.17). Starting from D, we keep applying Lawson flips until 
we reach the unique Delaunay triangulation D* of the perturbed point set. Now we 
examine this sequence of flips on the original unperturbed point set. All these flips must 
involve four cocircular points (only in the cocircular case, an infinitesimal perturbation 
can change “good” edges into “bad” edges that still need to be flipped). But as Figure|6.13] 
(a) easily implies, such a “degenerate” flip does not change the smallest of the six involved 
angles. It follows that D and D* have the same smallest angle, and since D* maximizes 
the smallest angle among all triangulations J (Theorem (6.18), so does D. O 


6.7 Constrained Triangulations 


Sometimes one would like to have a Delaunay triangulation, but certain edges are already 
prescribed, for example, a Delaunay triangulation of a simple polygon. Of course, one 
cannot expect to be able to get a proper Delaunay triangulation where all triangles satisfy 
the empty circle property. But it is possible to obtain some triangulation that comes as 
close as possible to a proper Delaunay triangulation, given that we are forced to include 
the edges in E. Such a triangulation is called a constrained Delaunay triangulation, a 
formal definition of which follows. 

Let P C R? be a finite point set and G = (P,E) a geometric graph with vertex set 
P (we consider the edges e € E as line segments). A triangulation J of P respects G if 
it contains all segments e € E. A triangulation J of P that respects G is said to be a 
constrained Delaunay triangulation of P with respect to G if the following holds for 
every triangle A of 7: 
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The circumcircle of A contains only points q € P in its interior that are not 
visible from the interior of A. A point q € P is uzszble from the interior of 
A if there exists a point p in the interior of A such that the line segment pq 
does not intersect any segment e € E. We can thus imagine the line segments 
of E as “blocking the view”. 


For illustration, consider the simple polygon and its constrained Delaunay triangula- 
tion shown in Figure The circumcircle of the shaded triangle A contains a whole 
other triangle in its interior. But these points cannot be seen from A”, because all 
possible connecting line segments intersect the blocking polygon edge e of A. 


Figure 6.15: Constrained Delaunay triangulation of a simple polygon. 


Theorem 6.21 For every finite point set P and every plane graph G = (P,E), there 
exists a constrained Delaunay triangulation of P with respect to G. 


Exercise 6.22 Prove Theorem|6.21| Also describe a polynomial algorithm to construct 
such a triangulation. 
Questions 
20. What ts a triangulation? Provide the definition and prove a basic property: every 
triangulation with the same set of vertices and the same outer face has the same 


number of triangles. 


21. What ts a triangulation of a point set? Give a precise definition. 
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22. Does every point set (not all points on a common line) have a triangulation? 
You may, for example, argue with the scan triangulation. 


23. What is a Delaunay triangulation of a set of points? Give a precise definition. 


24. What is the Delaunay graph of a point set? Give a precise definition and a 
characterization. 


25. How can you prove that every set of points (not all on a common line) has a 
Delaunay triangulation? You can for example sketch the Lawson flip algorithm 
and the Lifting Map, and use these to show the existence. 


26. When 1s the Delaunay triangulation of a point set unique? Show that general 
position is a sufficient condition. Is it also necessary? 


27. What can you say about the “quality” of a Delaunay triangulation? Prove 
that every Delaunay triangulation maximizes the smallest interior angle in the 
triangulation, among the set of all triangulations of the same point set. 
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Delaunay Triangulation: Incremental 
Construction 


In the last lecture, we have learned about the Lawson flip algorithm that computes a 
Delaunay triangulation of a given n-point set P C R? with O(n”) Lawson flips. One can 
actually implement this algorithm to run in O(n?) time, and there are point sets where 
it may take O(n?) flips. 

In this lecture, we will discuss a different algorithm. The final goal is to show that 
this algorithm can be implemented to run in O(nlogn) time; this lecture, however, is 
concerned only with the correctness of the algorithm. Throughout the lecture we assume 
that P is in general position (no 3 points on a line, no 4 points on a common circle), so 
that the Delaunay triangulation is unique (Corollary |6.17). There are techniques to deal 
with non-general position, but we don’t discuss them here. 


7.1 Incremental construction 


The idea is to build the Delaunay triangulation of P by inserting one point after another. 
We always maintain the Delaunay triangulation of the point set R inserted so far, and 
when the next point s comes along, we simply update the triangulation to the Delaunay 
triangulation of RU {s}. Let DI(R) denote the Delaunay triangulation of R C P. 

To avoid special cases, we enhance the point set P with three artificial points “far 
out”. The convex hull of the resulting point set is a triangle; later, we can simply remove 
the extra points and their incident edges to obtain DJ(P). The incremental algorithm 
starts off with the Delaunay triangulation of the three artificial points which consists 
of one big triangle enclosing all other points. (In our figures, we suppress the far-away 
points, since they are merely a technicality.) 

Now assume that we have already built DI(R), and we next insert s € P\R. Here is 
the outline of the update step. 


1. Find the triangle A = A(p,q,r) of DI(R) that contains s, and replace it with the 
three triangles resulting from connecting s with all three vertices p, q,r; see Figure 
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Figure 7.1: Inserting s into DI(R): Step 1 


We now have a triangulation J of R U {s}. 


2. Perform Lawson flips on J until DI(R U(s)) is obtained; see Figure [7.2] 


Figure 7.2: Inserting s into DI(R): Step 2 


How to organize the Lawson flips. The Lawson flips can be organized quite systematically, 
since we always know the candidates for “bad” edges that may still have to be flipped. 
Initially (after step 1), only the three edges of A can be bad, since these are the only 
edges for which an incident triangle has changed (by inserting s in Step 1). Each of 
the three new edges is good, since the 4 vertices of its two incident triangles are not in 
convex position. 

Now we have the following invariant (part (a) certainly holds in the first flip): 
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(a) In every flip, the convex quadrilateral Q in which the flip happens has exactly two 
edges incident to s, and the flip generates a new edge incident to s. 


(b) Only the two edges of Q that are not incident to s can become bad through the 
flip. 


We will prove part (b) in the next lemma. The invariant then follows since (b) entails 
(a) in the next flip. This means that we can maintain a queue of potentially bad edges 
that we process in turn. A good edge will simply be removed from the queue, and a bad 
edge will be flipped and replaced according to (b) with two new edges in the queue. In 
this way, we never flip edges incident to s; the next lemma proves that this is correct 
and at the same time establishes part (b) of the invariant. 


Lemma 7.1 Every edge incident to s that is created during the update is an edge of 
the Delaunay graph of RU{s} and thus an edge that will be in DI(RU(s)). It easily 
follows that edges incident to s will never become bad during the update step!!] 


Proof. Let us consider one of the first three new edges, Sp, say. Since the triangle 
A has a circumcircle C strictly containing only s (A is in DI(R)), we can shrink that 
circumcircle to a circle C’ through s and p with no interior points, see Figure (a). 
This proves that sp is in the Delaunay graph. If st is an edge created by a flip, a similar 
argument works. The flip destroys exactly one triangle A of DI(R). Its circumcircle C 
contains s only, and shrinking it yields an empty circle C’ through s and t. Thus, st is 
in the Delaunay graph also in this case. O 


(a) New edge sp incident (b) New edge st incident 
to s created in Step 1 to s created in Step 2 


Figure 7.3: Newly created edges incident to s are in the Delaunay graph 


‘If such an edge was bad, it could be flipped, but then it would be “gone forever” according to the 
lifting map interpretation from the previous lecture. 
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o 


Figure 7.4: The history graph: one triangle gets replaced by three triangles 


7.2 The History Graph 


What can we say about the performance of the incremental construction? Not much yet. 
First of all, we did not specify how we find the triangle A of DI(R) that contains the 
point s to be inserted. Doing this in the obvious way (checking all triangles) is not good, 
since already the find steps would then amount to O(n?) work throughout the whole 
algorithm. Here is a smarter method, based on the history graph. 


Definition 7.2 Given R C P (regarded as a sequence that reflects the insertion order), 
the history graph of R is a directed acyclic graph whose vertices are all triangles 
that have ever been created during the incremental construction of DJ(R). There 
is a directed edge from A to A’ whenever A has been destroyed during an insertion 
step, A’ has been created during the same insertion step, and A overlaps with A’ 
in its interior. 


It follows that the history graph contains triangles of outdegrees 3, 2 and 0. The ones of 
outdegree O are clearly the triangles of DI(R). 

The triangles of outdegree 3 are the ones that have been destroyed during Step 1 of 
an insertion. For each such triangle A, its three outneighbors are the three new triangles 
that have replaced it, see Figure 

The triangles of outdegree 2 are the ones that have been destroyed during Step 2 of 
an insertion. For each such triangle A, its two outneighbors are the two new triangles 
created during the flip that has destroyed A, see Figure 

The history graph can be built during the incremental construction at asymptotically 
no extra cost; but it may need extra space since it keeps all triangles ever created. Given 
the history graph, we can search for the triangle A of DJ(R) that contains s, as follows. 
We start from the big triangle spanned by the three far-away points; this one certainly 
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Figure 7.5: The history graph: two triangles get replaced by two triangles 


contains s. Then we follow a directed path in the history graph. If the current triangle 
still has outneighbors, we find the unique outneighbor containing s and continue the 
search with this neighbor. If the current triangle has no outneighbors anymore, it is in 
DI(R) and contains s—we are done. 


Types of triangles in the history graph. After each insertion of a point s, several triangles are 
created and added to the history graph. It is important to note that these triangles come 
in two types: Some of them are valid Delaunay triangles of RU{s}, and they survive to the 
next stage of the incremental construction. Other triangles are immediately destroyed 
by subsequent Lawson flips, because they are not Delaunay triangles of RU {s}. These 
“ephemeral" triangles will give us some headache (though not much) in the algorithm’s 
analysis in the next chapter. 

Note that, whenever a Lawson flip is performed, of the two triangles destroyed one 
of them is always a “valid" triangle from a previous iteration, and the other one is an 
“ephemeral" triangle that was created at this iteration. The ephemeral triangle is always 
the one that has s, the newly inserted point, as a vertex. 


71.3 The structural change 


Concerning the actual update (Steps 1 and 2), we can make the following 


Observation 7.3 Given DI(R) and the triangle A of DI(R) that contains s, we can 
build DJ(RU{s}) in time proportional to the degree of s in DJ(RU{s}), which is the 
number of triangles of DJ(RU{s}) containing s. 


Indeed, since every flip generates exactly one new triangle incident to s, the number 
of flips is the degree of s minus three. Step 1 of the update takes constant time, and 
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since also every flip can be implemented in constant time, the observation follows. 

In the next lecture, we will show that a clever insertion order guarantees that the 
search paths traversed in the history graph are short, and that the structural change (the 
number of new triangles) is small. This will then give us the O(n log n) algorithm. 


Exercise 7.4 For a sequence of n pairwise distinct numbers y;,...,Yn consider the se- 
quence of pairs (min{yi,..., yi}, max[y1,...,Yi)i=o,1,...n (min @ := +00, max) := —00). 
How often do these pairs change in expectation if the sequence is permuted ran- 
domly, each permutation appearing with the same probability? Determine the ez- 
pected value. 


Questions 


28. Describe the algorithm for the incremental construction of DI(P): how do we 
find the triangle containing the point s to be inserted into DI(R)? How do we 
transform DI(R) into DJ(RU{s})? How many steps does the latter transformation 
take, in terms of DI(R U {s})? 


29. What are the two types of triangles that the history graph contains? 
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The Configuration Space Framework 


In Section we have discussed the incremental construction of the Delaunay trian- 
gulation of a finite point set. In this lecture, we want to analyze the runtime of this 
algorithm if the insertion order is chosen unzformly at random among all insertion or- 
ders. We will do the analysis not directly for the problem of constructing the Delaunay 
triangulation but in a somewhat more abstract framework, with the goal of reusing the 
analysis for other problems. 

Throughout this lecture, we again assume general position: no three points on a line, 
no four on a circle. 


8.1 The Delaunay triangulation — an abstract view 


The incremental construction constructs and destroys triangles. In this section, we want 
to take a closer look at these triangles, and we want to understand exactly when a triangle 
is “there”. 


Lemma 8.1 Given three points p, q,r € R, the triangle Alp, q,r) with vertices p, q, Y 
is a triangle of DT(R) if and only if the circumctrcle of Alp, q,r) ts empty of points 
from R. 


Proof. The “only if” direction follows from the definition of a Delaunay triangulation 
(Definition [6.8). The “if” direction is a consequence of general position and Lemma|6.16} 
if the circumcircle C of A(p,q,r) is empty of points from R, then all the three edges 
pq, qr, pr are easily seen to be in the Delaunay graph of R. C being empty also implies 
that the triangle A(p, q,r) is empty, and hence it forms a triangle of DI(R). O 


Next we develop a somewhat more abstract view of DI(R). 


Definition 8.2 


(i) For all p,q,r € P, the triangle A = A(p,q,r) is called a configuration. The 
pomts p, q and r are called the defining elements of A. 


89 


Chapter 8. The Configuration Space Framework CG 2013 


(ii) A configuration A is in conflict with a point s € P if s is strictly inside the 
circumcircle of A. In this case, the pair (A,s) is called a conflict. 


(iti) A configuration A is called active w.r.t. RC P if (a) the defining elements of 
A are in R, and (b) if A is not in conflict with any element of R. 


According to this definition and Lemma DI(R) consists of exactly the configu- 
rations that are active w.r.t. R. Moreover, if we consider DJ(R) and DI(R U {s}) as sets 
of configurations, we can exactly say how these two sets differ. 

There are the configurations in DI(R) that are not in conflict with s. These config- 
urations are still in DI(R U {s}). The configurations of DI(R) that are in conflict with 
s will be removed when going from R to RU{s}. Finally, DJ(RU{s}) contains some new 
configurations, all of which must have s in their defining set. According to Lemma [8.1] 
it cannot happen that we get a new configuration without s in its defining set, as such 
a configuration would have been present in DI(R) already. 


8.2 Configuration Spaces 


Here is the abstract framework that generalizes the previous configuration view of the 
Delaunay triangulation. 


Definition 8.3 Let X (the ground set) and TI (the set of configurations) be finite sets. 
Furthermore, let 


DMT 2 


be a function that assigns to every configuration A a set of defining elements D(A). 
We assume that only a constant number of configurations have the same defining 
elements. Let 


KsIMT=.2* 


be a function that assigns to every configuration A a set of elements in conflict with 
A (the “killer” elements). We stipulate that D(A) NK(A) = 9 for all A € TT. 
Then the quadruple S = (X, TI, D,K) is called a configuration space. The number 


d= d(S) := max |D(A)| 


is called the dimension of 8. 
Given R C X, a configuration A is called active w.r.t. R af 


D(A) CR and K(AJNR=0, 


1.e. uf all defining elements are in R but no element of R is in conflict with A. The 
set of active configurations w.r.t. R is denoted by Js(R), where we drop the subscript 
uf the configuration space is clear from the contezt. 
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In case of the Delaunay triangulation, we set X = P (the input point set). TT consists 
of all triangles A = A(p, q, r) spanned by three points p, q,r € XU{a, b,c}, where a, b,c 
are the three artificial far-away points. We set D(A) := {p,q,r}NX. The set K(A) consists 
of all points strictly inside the circumcircle of A. The resulting configuration space has 
dimension 3, and the technical condition that only a constant number of configurations 
share the defining set is satisfied as well. In fact, every set of three points defines a 
unique configuration (triangle) in this case. A set of two points or one point defines 
three triangles (we have to add one or two artificial points which can be done in three 
ways). The empty set defines one triangle, the initial triangle consisting of just the three 
artificial points. 

Furthermore, in the setting of the Delaunay triangulation, a configuration is active 
w.r.t. Rif it is in DJ(RU {a, b,c}), i.e. we have J(R) = DI(RU(a, b, c}). 


8.3 Expected structural change 


Let us fix a configuration space 8 = (X, M, D,K) for the remainder of this lecture. We 
can also interpret the incremental construction in $. Given R C X and s € XA R, we 
want to update J(R) to J(RU{s}). What is the number of new configurations that arise 
during this step? For the case of Delaunay triangulations, this is the relevant question 
when we want to bound the number of Lawson flips during one update step, since this 
number is exactly the number of new configurations minus three. 

Here is the general picture. 


Definition 8.4 For Q C X ands € Q, deg(s, Q) is defined as the number of configura- 
tions of T(Q) that have s tn their defining set. 


With this, we can say that the number of new configurations in going from J(R) to 
T(RU(s)) is precisely deg(s, RU{s}), since the new configurations are by definition exactly 
the ones that have s in their defining set. 

Now the random insertion order comes in for the first time: what is 


E(deg(s, RU {s})), 


averaged over all insertion orders? In such a random insertion order, R is a random r- 
element subset of X (when we are about to insert the (r+1)-st element), and s isa random 
element of XA R. Let J, be the “random variable” for the set of active configurations 
after r insertion steps. 

It seems hard to average over all R, but there is a trick: we make a movie of the 
randomized incremental construction, and then we watch the movie backwards. What 
we see is elements of X being deleted one after another, again in random order. This is 
due to the fact that the reverse of a random order is also random. At the point where the 
(r+1)-st element is being deleted, it is going to be a random element s of the currently 
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present (r + 1)-element subset Q. For fixed Q, the expected degree of s is simply the 
average degree of an element in Q which is 


1 d 
aA 2 des(s, Q) < 10) 


since the sum counts every configuration of J(Q) at most d times. Since Q is a random 
(r + 1)-element subset, we get 


d 
E(deg(s, R een Ee 
(deg(s, RU (s})) < tres 


where t,,, is defined as the expected number of active configurations w.r.t. a random 
(r+ 1)-element set. 

Here is a more formal derivation that does not use the backwards movie view. It 
exploits the bijection 


(R,s) > (RU {s}, s) 
Q 


between pairs (R, s) with |R| = r and s ¢ R and pairs (Q,s) with |Q| =r+1 and s € Q. 
Let n = |X|. 


1 1 
E(deg(s, RU{s})) = -= —— Y deg(s,RU{s}) 
5) RCX,|R|=r Wer s€X\R 
= En ——— ) deg(s, ) 
ES) QCX,|Q|=r+1 seQ 
1 ( da) 1 
a = deg(s, Q) 
lea) oe (7) Er 2 
1 1 
= n deg(s, ) 
(04) oe. PIS 2 
e 
D R. QCX,IQ|=r+1 ul 
= r41 r+1 


Thus, the expected number of new configurations in going from J, to J,,, is bounded 
by 


ie, 
r+ 1 


tri) 
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where t, 1 is the expected size of T,,:. 
What do we get for Delaunay triangulations? We have d = 3 and t,,1 < 2(r+4)—4 
(the maximum number of triangles in a triangulation of r + 4 points). Hence, 


6 12 
Jie x6 


E(deg(s, RU {s})) < El 


This means that on average, ~ 3 Lawson flips are done to update DT, (the Delaunay 
triangulation after r insertion steps) to DT,,,. Over the whole algorithm, the expected 
update cost is thus O(n). 


8.4 Bounding location costs by conflict counting 


Before we can even update DI, to DI,,, during the incremental construction of the 
Delaunay triangulation, we need to locate the new point s in DJ, meaning that we need 
to find the triangle that contains s. We have done this with the history graph: During 
the insertion of s we “visit" a sequence of triangles from the history graph, each of which 
contains s and was created at some previous iteration k < r. 

However, some of these visited triangles are “ephemeral" triangles (recall the discus- 
sion at the end of Section [7.2), and they present a problem to the generic analysis we 
want to perform. Therefore, we will do a charging scheme, so that all triangles charged 
are valid Delaunay triangles. 

The charging scheme is as follows: If the visited triangle A is a valid Delaunay triangle 
(from some previous iteration), then we simply charge the visit of A during the insertion 
of s to the triangle-point pair (A, s). 

If, on the other hand, A is an “ephemeral" triangle, then A was destroyed, together 
with some neighbor A’, by a Lawson flip into another pair A”, A”. Note that this 
neighbor A’ was a valid triangle. Thus, in this case we charge the visit of A during the 
insertion of s to the pair (A’,s). Observe that s is contained in the circumcircle of A’, 
so s is in conflict with A’. 

This way, we have charged each visit to a triangle in the history graph to a triangle- 
point pair of the form (A, s), such that A is in conflict with s. Furthermore, it is easy to 
see that no such pair gets charged more than once. 

We define the notion of a conflict in general: 


Definition 8.5 A conflict is a configuration-element pair (A,s) where A € T, for some 
r and s € K(A). 


Thus, the running time of the Delaunay algorithm is proportional to the number of 
conflicts. We now proceed to derive a bound on the expected number of conflicts in the 
generic configuration-space framework. 
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8.5 Expected number of conflicts 


Since every configuration involved in a conflict has been created in some step r (we 
include step 0), the total number of conflicts is 


Y Y ka 


T=0 AET;\Tr-1 


where T_, := Ø. To consists of constantly many configurations only (namely those where 
the set of defining elements is the empty set), each of which is in conflict with at most 
all elements; moreover, no conflict is created in step n. Hence, 


Y Y Kial=0mM+Y Y KA), 


r=0 AETAT 1 r=1 AET,\T+-1 
and we will bound the latter quantity. Let 
K(r):= Y) [K(A), r1=1,...,n—1. 
Acetd+\Tr-1 


and k(r) := E(K(r)) the expected number of conflicts created in step r. 


Bounding k(r). We know that J, arises from a random r-element set R. Fixing R, the 
backwards movie view tells us that J,_, arises from J, by deleting a random element s 
of R. Thus, 


k(r) al D IS Y A 


RCX,|R|=r T SER AET(RIAT(RA(s)) 


A ER Z Ka 


REX [Rl=r sER AECT(R),seD(A) 


< al Y f5 Ka, 


7 
RCX,[R|=r AET(R) 


since in the sum over s € R, every configuration is counted at most d times. Since we 
can rewrite 


Y KA= Y HAeETR):y eKA), 


AET(R) yEXXR 
we thus have 
1 d 
kr) <- » =D) HMAET(R):y € K(A)). 
a RCX,|Rl=r yEX\R 


To estimate this further, here is a simple but crucial 
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Lemma 8.6 The configurations in T(R) that are not in conflict with y are the config- 
urations in T(RU([y)) that do not have y in their defining set; in formulas: 


¡T(R)|— KA € T(R) : y € K(A)} = [T(RU {y})] — deg(y, RU [y)). 


The proof is a direct consequence of the definitions: every configuration in J(R) not in 
conflict with y is by definition still present in T(R U {y}) and still does not have y in 
its defining set. And a configuration in J(R U {y} with y not in its defining set is by 
definition already present in J(R) and already there not in conflict with y. 


The lemma implies that 
k(r) < ki(r) — ka(r) + ka(r), 


where 


kin) = Tay Y E Y aegty, RUI). 


RCX,IRI=r  yeX\R 


Estimating k,(r). This is really simple. 


k, (r) 


REX, |R]=r yEXXR 


a) y iS 
1 
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Estimating k2(r). For this, we need to employ our earlier (R,y) + (RU {y}, y) bijection 
again. 


ko(r) = 


Y EY RU 


1 

(© 
TY RCX,|R|=r yYEX\R 
1 


= Y “Y OQ)! 


Qcx,|Ql=r+1  yeQ 


E! EA 
a Gee a 


r+1/ QCX,|Q|=r+1 (+) 


=o YE mamo 


(ta) gexa 


d 
a —=(n —=3)tr+ 
7 
d n 
= — 1))t.. ——t, 
ap (r+ 1)) TS y +1 
dn 
= ki(r+1)4 tka 
i(T ) r(r+1) +1 


Estimating k3(r). This is similar to k2(r) and in addition uses a fact that we have em- 
ployed before: 2 yego deg(y, Q) < dlT(Q)I. 


kr) = ar Y Y deglu, RUNY) 


(7) RCX,|Rl=r yeX\R 


= æ 1 50 
1 


QcxQl=r+1 yeQ 


> y ona 

le QCX,|Q|=r+1 

ta 
( k ) QCX,IQ|=r+1 C) 


E 2 
y HO 


1 
(a) QCX,|Q\|=r+1 


IN 
| 


a) q? 
ZIQ) 


d2 
= rr L 1) (n i Tita 
dn 4 
= t trait. 
r(r 1) T+1 r 1 r+1 
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Summing up. Let us recapitulate: the overall expected number of conflicts is O(n) plus 


n-1 n-1 
k(r) = (k, (1) — k2(r) + ka(r)) 
r=1 T=1 
Using our previous estimates, k,(2),...,k,(n — 1) are canceled by the first terms of 


k2(1),...,ke(n — 2). The second term of k2(r) can be combined with the first term of 
k3(r), so that we get 


n-1 n-—1 d(d— 1) n—1 a? 
AMN — ka(r) + ks(r)) < kı(1) = 2 TF1) tr+1 2 aia tet 
= trai 
< d(n—1)tz+d(d—1)n ie. 


en 


T= 


I 


O [ny 5) 


r=1 


The Delaunay case. We have argued that the expected number of conflicts asymptotically 
bounds the expected total location cost over all insertion steps. The previous equation 
tells us that this cost is proportional to O(n) plus 


O (m ata) =0 (ež :) = O(n logn). 


r=1 


is the n-th Harmonic Number which is known to be approximately lnn. 

By going through the abstract framework of configuration spaces, we have thus ana- 
lyzed the randomized incremental construction of the Delaunay triangulation of n points. 
According to Section the expected update cost itself is only O(n). The steps dom- 
inating the runtime are the location steps via the history graph. According to Section 
all history graph searches (whose number is proportional to the number of conflicts) 
can be performed in expected time O(nlogn), and this then also bounds the space 
requirements of the algorithm. 


Exercise 8.7 Design and analyze a sorting algorithm based on randomized incremen- 


tal construction in configuration spaces. The input 1s a set S of numbers, and the 
output should be the sorted sequence (in increasing order). 
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a) Define an appropriate configuration space for the problem! In particular, the 


set of active configurations w.r.t. S should represent the desired sorted se- 
quence. 


b) Provide an efficient implementation of the incremental construction algo- 


rithm. “Efficient” means that the runtime of the algorithm 15 asymptotically 
dominated by the number of conflicts. 


c) What is the expected number of conflicts (and thus the asymptotic runtime of 


your sorting algorithm) for a set S of n numbers? 


Questions 


30. 


31. 


32. 


33. 


34. 


What is a configuration space? Give a precise definition! What is an active 
configuration? 


How do we get a configuration space from the problem of computing the De- 
launay triangulation of a finite point set? 


How many new active configurations do we get on average when inserting the 
r-th element? Provide an answer for configuration spaces in general, and for the 
special case of the Delaunay triangulation. 


What is a conflict? Provide an answer for configuration spaces in general, and 
for the special case of the Delaunay triangulation. 


Explain why counting the expected number of conflicts asymptotically bounds 
the cost for the history searches during the randomized incremental construc- 
tion of the Delaunay triangulation! 


98 


Chapter 9 


Voronoi Diagrams 


9.1 Post Office Problem 


Suppose there are n post offices pi,...Pn in a city. Someone who is located at a position 
q within the city would like to know which post office is closest to him. Modeling the 
city as a planar region, we think of p;,,...p, and q as points in the plane. Denote the 
set of post offices by P = {pi,... pn}. 


Figure 9.1: Closest post offices for various query points. 


While the locations of post offices are known and do not change so frequently, we do 
not know in advance for which—possibly many—query locations the closest post office 
is to be found. Therefore, our long term goal is to come up with a data structure on 
top of P that allows to answer any possible query efficiently. The basic idea is to apply 
a so-called locus approach: we partition the query space into regions on which is the 
answer is the same. In our case, this amounts to partition the plane into regions such 
that for all points within a region the same point from P is closest (among all points 
from P). 

As a warmup, consider the problem for two post offices pi, pj € P. For which query 
locations is the answer p; rather than p;? This region is bounded by the bisector of pi 
and p;, that is, the set of points which have the same distance to both points. 


99 


Chapter 9. Voronoi Diagrams CG 2013 


Proposition 9.1 For any two distinct points in Rt the bisector is a hyperplane, that 
is, in R? it is a line. 


Proof. Let p = (p1,...,Ppa) and q = (qi,-.-,qa) be two points in Rt. The bisector of 
p and q consists of those points x = (x1, ..., Xa) for which 


2 


lip — xll = Ilq—xll <= lp — x? =I1q—x1? <=> Itpl? al = 2(p —q) "x. 


As p and q are distinct, this is the equation of a hyperplane. O 


Pj 


PiS 


H(pi, pj) 


Figure 9.2: The bisector of two points. 


Denote by H(pi, p;) the closed halfspace bounded by the bisector of p; and p; that 
contains p;. In R?, the region H(p;, p;) is a halfplane; see Figure [9.2] 


Exercise 9.2 


a) What is the bisector of a line l and a point p € R? \ £, that is, the set of all 
points x € R? with ||x — pl| = |x — €I| (= mingee lix — qll)? 


b) For two points p 4 q € R*, what is the region that contains all points whose 
distance to p ts exactly twice their distance to q? 
9.2 Voronoi Diagram 
In the following we work with a set P = (pi, ..., pn} of points in R?. 
Definition 9.3 (Voronoi cell) For pı € P denote the Voronoi cell Vp(i) of pi by 
Vp(i) = {q € R? | llq — pill < liq —pll for all p € P}. 
Proposition 9.4 


Vp(1) =( )H(pi, p3) - 
A 


Proof. For j #i we have |lq — pill < llq — p;ll <= q € H(pi, py). O 
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Corollary 9.5 Vp(i) is non-empty and convez. 


Proof. According to Proposition the region Vp(i) is the intersection of a finite 
number of halfplanes and hence convex. As pi € Vp(i), we have Vp(i) £ 0. O 


Observe that every point of the plane lies in some Voronoi cell but no point lies in the 
interior of two Voronoi cells. Therefore these cells form a subdivision of the plane. See 
Figure [9.3] for an example. 


Definition 9.6 (Voronoi Diagram) The Voronoi Diagram VD(P) of a set P = {pi, ..., Pn} of 
points in R? is the subdivision of the plane induced by the Voronoi cells Vp(i), for 
GH 1, cen gy Me 

Denote by VV(P) the set of vertices, by VE(P) the set of edges, and by VR(P) the 
set of regions (faces) of VD(P). 


Figure 9.3: Example: The Voronoi diagram of a point set. 


Lemma 9.7 For every vertex v € VV(P) the following statements hold. 
a) v is the common intersection of at least three edges from VE(P); 
b) v is incident to at least three regions from VR(P); 
c) v is the center of a circle C(v) through at least three points from P such that 
d) Civ) n P=. 


Proof. Consider a vertex v € VV(P). As all Voronoi cells are convex, k > 3 of them 
must be incident to v. This proves Part a) and b). 
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Without loss of generality let these cells be Vp(i), for 
1<i<k. Denote by ei, 1 < i < k, the edge incident to v 
that bounds Vp(i) and Vp((imod k) + 1). 

For any i=1,...,k we have v € e; > |lv— pill = |lv— 
P(imoak)+il|. In other words, pi, P2, .-., Px are cocircular, 
which proves Part c). 

Part d): Suppose there exists a point pe € C(v)°. Then 


the vertex v is closer to pe than it is to any of p1,...,Px, 
in contradiction to the fact that v is contained in all of 
Vp(1),..., Vp(k). O 


Corollary 9.8 If P is in general position (no four points from P are cocircular), then 
for every vertex v € VV(P) the following statements hold. 


a) v is the common intersection of exactly three edges from VE(P); 

b) v is incident to exactly three regions from VR(P); 

c) v is the center of a circle C(v) through ezactly three points from P such that 
d) Cy) NA P=0: O 


Lemma 9.9 There is an unbounded Voronoi edge bounding Vp(i) and Vp(j) => 
Pipi NP = {pi, pj} and pip; C Oconv(P), where the latter denotes the boundary of the 
conver hull of P. 


Proof. 
Denote by bi; the bisector of p; and pj, and let € oe 
denote the family of circles centered at some point we SA 
on bij and passing through p; (and pj). There is ae p sD 
an unbounded Voronoi edge bounding Vp(i) and z i 
Vp(j) => there is a ray p C bij such that : gee | Be 
llr — pxll > Ilr — pill (= Ilr — pjll), for every r € p : Y |e 
and every pk € P with k ¢ {i,j}. Equivalently, ES 
there is a ray p C bij such that for every point a : a $70 
r E p the circle C € € centered at r does not a | i 
contain any point from P in its interior. ae Ee 
The latter statement implies that the open Pi oaa PI 
halfplane H, whose bounding line passes through la dd 
pi and p; and such that H contains the infinite bij 
part of p, contains no point from P in its interior. 
Therefore, Pip; appears on dconv(P) and pip; does not contain any px € P, for k Æ i,j. 
Conversely, suppose that pipj appears on dconv(P) and pip; N P = {pi, pj}. Then 
some halfplane H whose bounding line passes through p; and p; contains no point from 
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P in its interior. In particular, the existence of H together with pip; MP = {pi, p;) implies 
that there is some circle C € € such that C N P = {pi, pj}. Denote by ro the center of 
C and let p denote the ray starting from To along b;,; such that the infinite part of p is 
contained in H. Consider any circle D € © centered at a point r € p and observe that 
D\HCC\H. As neither H nor C contain any point from P in their respective interior, 
neither does D. This holds for every D, and we have seen above that this statement is 
equivalent to the existence of an unbounded Voronoi edge bounding Vp(i) and Vp(j). O 


9.3 Duality 


A straight-line dual of a plane graph G is a graph G’ defined as follows: Choose a point 
for each face of G and connect any two such points by a straight edge, if the corresponding 
faces share an edge of G. Observe that this notion depends on the embedding; that 
is why the straight-line dual is defined for a plane graph rather than for an abstract 
graph. In general, G’ may have edge crossings, which may also depend on the choice of 
representative points within the faces. However, for Voronoi diagrams is a particularly 
natural choice of representative points such that G’ is plane: the points from P. 


Theorem 9.10 (Delaunay [2]) The straight-line dual of VD(P) for a set P Cc R? ofn >3 
points in general position (no three points from P are collinear and no four points 
from P are cocircular) is a triangulation: the unique Delaunay triangulation of P. 


Proof. By Lemma|9.9} the convex hull edges appear in the straight-line dual T of VD(P) 
and they correspond exactly to the unbounded edges of VD(P). All remaining edges 
of VD(P) are bounded, that is, both endpoints are Voronoi vertices. Consider some 
v € VV(P). According to Corollary [9.8[b), v is incident to exactly three Voronoi regions, 
which, therefore, form a triangle A(v) in T. By Corollary [9.8[d), the circumcircle of A(v) 
does not contain any point from P in its interior. Hence A(v) appears in the (unique by 
Corollary |6.17) Delaunay triangulation of P. 

Conversely, for any triangle pip;px in the Delaunay triangulation of P, by the empty 
circle property the circumcenter c of pipjpx has pi, pj, and px as its closest points from 
P. Therefore, c € VV(P) and—as above—the triangle pip;px appears in T. O 


It is not hard to generalize Theorem to general point sets. In this case, a 
Voronoi vertex of degree k is mapped to a convex polygon with k cocircular vertices. 
Any triangulation of such a polygon yields a Delaunay triangulation of the point set. 


Corollary 9.11 |VE(P)| < 3n—6 and |VV(P)| < 2n—5. 


Proof. Every edge in VE(P) corresponds to an edge in the dual Delaunay triangulation. 
The latter is a plane graph on n vertices and thus has at most 3n — 6 edges and at most 
2n—4 faces by Lemma|4.1] Only the bounded faces correspond to a vertex in VD(P). Ol 
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Figure 9.4: The Voronoi diagram of a point set and its dual Delaunay triangulation. 


Corollary 9.12 For a set P C R? of n points, the Voronoi diagram of P can be con- 
structed in expected O(nlogn) time and O(n) space. 


Proof. We have seen that a Delaunay triangulation T for P can be obtained using 
randomized incremental construction in the given time and space bounds. As T isa 
plane graph, its number of vertices, edges, and faces all are linear in n. Therefore, the 
straight-line dual of T—which by Theorem [9.10] is the desired Voronoi diagram—can be 
computed in O(n) additional time and space. O 


Exercise 9.13 Consider the Delaunay triangulation T for a set P C R? ofn > 3 points 
in general position. Prove or disprove: 


a) Every edge of T intersects its dual Voronoi edge. 


b) Every verter of VD(P) is contained in its dual Delaunay triangle. 


9.4 Lifting Map 


Recall the lifting map that we used in Section|6.3|to prove that the Lawson Flip Algorithm 
terminates. Denote by U : z = x? + y? the unit paraboloid in R°. The lifting map 
l: R? > U with £: p = (px, Py) (Px, Py» Px? + Py”) is the projection of the x/y-plane 
onto U in direction of the z-axis. 

For p € R? let H, denote the plane of tangency to U in £(p). Denote by hp : R? > Hp 
the projection of the x/y-plane onto H, in direction of the z-axis (see Figure [9.5). 


Lemma 9.14 ||€(q) — h, (q)ll = Ilp — ql, for any points p,q € R?. 
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Figure 9.5: Lifting map interpretation of the Voronoi diagram in a two-dimensional 
projection. 


Exercise 9.15 Prove Lemma Hint: First determine the equation of the tangent 
plane H, to U in L(p). 


Theorem 9.16 For p = (px,py) € R? denote by H, the plane of tangency to the unit 
paraboloid U = {(x,y,z) : 2=x*+yYy*) C R? in L(p) = (px, Py, Px? + Py”). Let H(P) := 
(pep H$ the intersection of all halfspaces above the planes H,, for p € P. Then the 
vertical projection of OH(P) onto the x/y-plane forms the Voronoi Diagram of P 
(the faces of OH(P) correspond to Voronoi regions, the edges to Voronoi edges, and 
the vertices to Voronoi vertices). 


Proof. For any point q € R?, the vertical line through q intersects every plane Hy, 
p € P. By Lemma the topmost plane intersected belongs to the point from P that 
is closest to q. O 


9.5 Point location in a Voronoi Diagram 
One last bit is still missing in order to solve the post office problem optimally. 


Theorem 9.17 Given a triangulation T for a set P C R? ofn points, one can build in 
O(n) time an O(n) size data structure that allows for any query point q € conv(P) 
to find in O(logn) time a triangle from T containing q. 


The data structure we will employ is known as Kirkpatrick’s hierarchy. But before 
discussing it in detail, let us put things together in terms of the post office problem. 


Corollary 9.18 (Nearest Neighbor Search) Given a set P C R? of n points, one can build 
in expected O(nlogn) time an O(n) size data structure that allows for any query 
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point q € conv(P) to find in O(logn) time a nearest neighbor of q among the points 
from. P. 


Proof. First construct the Voronoi Diagram V of P in expected O(nlog n) time. It has 
exactly n convex faces. Every unbounded face can be cut by the convex hull boundary 
into a bounded and an unbounded part. As we are concerned with query points within 
conv(P) only, we can restrict our attention to the bounded parts['| Any convex polygon 
can easily be triangulated in time linear in its number of edges (= number of vertices). 
As V has at most 3n — 6 edges and every edge appears in exactly two faces, V can 
be triangulated in O(n) time overall. Label each of the resulting triangles with the 
point from p, whose Voronoi region contains it, and apply the data structure from 
Theorem [9.17 O 


9.5.1 Kirkpatrick's Hierarchy 


We will now the develop the data structure for point location in a triangulation, as 
described in T'heorem The main idea is to construct a hierarchy To,..., In of trian- 
gulations, such that 


e To T, 
e the vertices of T; are a subset of the vertices of T;_,, fori=1,...,h, and 


e T, comprises a single triangle only. 


Search. For a query point x the triangle from T containing x can be found as follows. 
Search(x € R?) 
1. Fori=h,h-—1,...,0: Find a triangle t; from T; that contains x. 


2. return to. 
This search is efficient under the following conditions. 


(C1) Every triangle from Ti intersects only few (< c) triangles from Ti—1. (These will 
then be connected via the data structure. ) 


(C2) his small (< dlogn). 


Proposition 9.19 The search procedure described above needs < 3cdlogn = O(logn) 
orientation tests. 


Proof. For every Ti, 0 <i< h, at most c triangles are tested as to whether or not they 
contain x. Using three orientation tests one can determine whether or not a triangle 
contains a given point. O 


We even know how to decide in O(log n) time whether or not a given point lies within conv(P), see 


Exercise [8.22] 
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Thinning. Removing a vertex v and all its incident edges from a triangulation creates a 
non-triangulated hole that forms a star-shaped polygon since all points are visible from 
v (the star-point). 


Lemma 9.20 A star-shaped polygon, given as a sequence of n > 3 vertices and a 
star-point, can be triangulated in O(n) time. 


Proof. For n = 3 there is nothing to do. For n > 3, consider a star-shaped polygon 
P = (po,---,Pn—1) and a star-point s of P. Consider some convex vertex p; of P, that is, 
ZPis1PiPi_1 < Tr, all indices taken modn. (Every simple polygon on n > 3 vertices has 
at least three convex vertices, on the convex hull.) 

As P is star-shaped, the quadrilateral Q = pi_ipipi+is is completely contained in 
P. Therefore, if 7pi_ispizi < TT and hence Q is convex (Figure [9.6a), then we can add 
Pi-1Pi+1 as a diagonal. In this way one triangle is cut off from P, leaving a star-shaped 
polygon P’ (with respect to s) on n—1 vertices. The polygon P’ can then be triangulated 
recursively. If, on the other hand, Zp;_ispisi1 > 7 (Figure [9.6b)), we cannot safely add 


a a Pi+1 pi 


(a) Lpi-15Piy1 < 7. (b) Zpi-1Spi+1 > T. 


Figure 9.6: The quadrilateral pi_ipipi+is 1s contained in P. 


the edge pi_ipii1 because it might intersect other edges of P or even lie outside of P. 
But we claim that in this case there exists another convex vertex pj of P, for which 
Zpj—1SPj+1 < 7 and therefore we can add the edge pj_1pj+1 instead. 

In fact, it is enough to choose p; to be some convex vertex of P that is not a neighbor 
of pi: As Ta LPx-18Ppx = 27 and Zpi_ispi + 4PiSPir1 = ZPi_iSPi+1 > 7, we have 
LPj-1SPj + ZPjSPj41 = LPj-1SPj41 < T. 

It remains to show that such a vertex pj exists. P has at least three convex vertices. 
One of them is pi. If the only other two convex vertices are pi—ı and pi+1, then we make 
the whole argument with p;_; instead of p; and find j =i+ 1. Note that pi—ı and pi, 
are not neighbors because P is not a triangle. 

As for the linear time bound, we simply scan the sequence of vertices as in Graham’s 
Scan. For every triple pi_ipipi+1 of successive vertices it is tested (in constant time) 
whether pi_ipipi+is forms a convex quadrilateral. If so, then the edge pi_ipi4+i is added, 
effectively removing p; from further consideration. Therefore, p; can be charged for the 
(potential, if there are enough vertices left) additional test of the new triple p,pi_1pi+ 
formed with the predecessor p, of pj_; in the current sequence. As shown for Graham’s 
Scan in Theorem |3.25] this results in a linear time algorithm overall P| O 


me 


2Recall that the O(nlog n) time bound for Graham's Scan was caused by the initial sorting only. 
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As a side remark, the kernel of a simple polygon, that is, the set of all star-points, 
can be constructed in linear time as well. We will get back to this question later in the 
course... 

Our working plan is to obtain T; from T;_; by removing several independent (pairwise 
non-adjacent) vertices and re-triangulating. These vertices should 


a) have small degree (otherwise the degree within the hierarchy gets too large, that 
is, we need to test too many triangles on the next level) and 


b) be many (otherwise the height h of the hierarchy gets too large). 


The following lemma asserts the existence of a sufficiently large set of independent 
small-degree vertices in every triangulation. 


Lemma 9.21 In every triangulation of n points in R? there exists an independent set 
of at least |[n/18| vertices of maximum degree 8. Moreover, such a set can be found 
in O(n) time. 


Proof. Let T = (V, E) denote the graph of the triangulation, which we consider as an 
abstract graph in the following. We may suppose that T is maximally planar, that is, all 
faces are triangles. (Otherwise triangulate the exterior face arbitrarily. An independent 
set in the resulting graph is also independent in T.) For n = 3 the statement is true. Let 
n > 4. 

By the Euler formula we have |E| = 3n — 6, that is, 


Y degr(v) = 2|E| = 6n — 12 < 6n. 
vEV 


Let W C V denote the set of vertices of degree at most 8. Claim: |W| > n/2. Suppose 
|W| < n/2. As every vertex has degree at least three, we have 


Y degr(v) = Y) degr(v+ J  degr(v) > 3[W|+9/VA WI 


veV veW vEVXW 
3|/W]| + 9(n — |W|) = 9n — 6|W] > 9n — 3n = 6n, 


I 


in contradiction to the above. 

Construct an independent set U in T as follows (greedily): As long as W # f, add an 
arbitrary vertex v € W to U and remove v and all its neighbors from W. 

Obviously U is independent and all vertices in U have degree at most 8. At each 
selection step at most 9 vertices are removed from W. Therefore |U| > [(n/2)/9] = 
[n/18]. O 


Proof. (of Theorem [9.17) 
Construct the hierarchy To,... Tn with To = T as follows. Obtain Ti from T; 1 by removing 
an independent set U as in Lemma and re-triangulating the resulting holes. By 
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Lemma and Lemma every step is linear in the number |T;| of vertices in Ti. 
The total cost for building the data structure is thus 


h h 
Y alTil< S aní 17/18)' el 17/18)? = 18an € O(n), 
i=0 i=0 


i=0 


for some constant ax. Similarly the space consumption is linear. 
The number of levels amounts to h = log;s,1,n < 12.2logn. Thus by Proposi- 
tion the search needs at most 3 - 8 - logig/17n < 292 log n orientation tests. O 


Improvements. As the name suggests, the hierarchical approach discussed above is due 
to David Kirkpatrick [5]. The constant 292 that appears in the search time is somewhat 
large. There has been a whole line of research trying to improve it using different 
techniques. 


e Sarnak and Tarjan [6]: 4logn. 
e Edelsbrunner, Guibas, and Stolfi [3]: 3logn. 


e Goodrich, Orletsky, and Ramaiyer [4]: 2logn. 
e Adamy and Seidel [I]: 1logn + 2y1og n + O( Vlog n). 


Exercise 9.22 Let {pi, p2,...,Pn} be a set of points in the plane, which we call obsta- 
cles. Imagine there 1s a disk of radius r centered at the origin which can be moved 
around the obstacles but 1s not allowed to intersect them (touching the boundary is 
ok). Is it possible to move the disk out of these obstacles? See the example depicted 
in Figure below. 

More formally, the question 1s whether there 1s a (continuous) path y : [0,1] —> 
R? with y(0) = (0,0) and ||y(1)|| > max{||pill,..-, pnl}, such that at any time t € 
[0,1] and |y(t) — pill > r, for any 1 <S i < n. Describe an algorithm to decide 
this question and to construct such a path—if one exists—given arbitrary points 
{P1 P2,- -- Pn} and a radius r >0. Argue why your algorithm is correct and analyze 
its running time. 


Exercise 9.23 This exercise 1s about an application from Computational Biology: 

You are given a set of disks P = {a1, .., an} in R?, all with the same radius ra > 0. 
Each of these disks represents an atom of a protein. A water molecule 1s represented 
by a disc with radius Ty > Ta. A water molecule cannot intersect the interior of 
any protein atom, but it can be tangent to one. We say that an atom a; € P îs 
accessible 2f there exists a placement of a water molecule such that it is tangent to 
a; and does not intersect the interior of any other atom in P. Given P, find an 
O(nlogn) time algorithm which determines all atoms of P that are inaccessible. 
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Figure 9.7: Motion planning: Illustration for Exercise [9.22] 


Exercise 9.24 Let P C R? be a set of n points. Describe a data structure to find in 
O(logn) time a point in P that is furthest from a given query point q among all 
points in P. 


Exercise 9.25 Show that the bounds given in Theorem |9.1] are optimal in the alge- 
braic computation tree model. 


Questions 


35. 


36. 


37. 


38. 


39. 


What is the Voronoi diagram of a set of points in R?? Give a precise definition 
and explain/prove the basic properties: convexity of cells, why is it a subdivision 


of the plane?, Lemma Lemma [9.9] 


What is the correspondence between the Voronoi diagram and the Delaunay 
triangulation for a set of points in R?? Prove duality (Theorem|9.10) and explain 
where general position is needed. 


How to construct the Voronoi diagram of a set of points in R?? Describe an 
O(nlogn) time algorithm, for instance, via Delaunay triangulation. 


How can the Voronoi diagram be interpreted in context of the lifting map? 
Describe the transformation and prove its properties to obtain a formulation of the 
Voronoi diagram as an intersection of halfspaces one dimension higher. 


What is the Post-Office Problem and how can it be solved optimally? De- 
scribe the problem and a solution using linear space, O(n log n) preprocessing, and 
O(log n) query time. 
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40. How does Kirkpatrick’s hierarchical data structure for planar point location 
work exactly? Describe how to build it and how the search works, and prove the 
runtime bounds. In particular, you should be able to state and prove Lemma [9.20] 


Lemma [9.21) and Theorem [9.17] 
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Chapter 10 


Trapezoidal Maps 


In this section, we will see another application of randomized incremental construction 
in the abstract configuration space framework. At the same time, this will give us an 
efficient algorithm for solving the general problem of point location, as well as a faster 
algorithm for computing all intersections between a given set of line segments. 


10.1 The Trapezoidal Map 


To start with, let us introduce the concept of a trapezordal map. 

We are given a set S = {s1,..., Sn} of line segments in the plane (not necessarily 
disjoint). We make several general position assumptions. 

We assume that no two segment endpoints and intersection points have the same 
x-coordinate. As an exception, we do allow several segments to share an endpoint. We 
also assume that no line segment is vertical, that any two line segments intersect in at 
most one point (which is a common endpoint, or a proper crossing), and that no three 
line segments have a common point. Finally, we assume that s; C [0, 1]? for all i (which 
can be achieved by scaling the coordinates of the segments accordingly). 


Definition 10.1 The trapezoidal map of S is the partition of [0,1]? into vertices, edges, 
and faces (called trapezoids), obtained as follows. Every segment endpoint and point 
of intersection between segments gets connected by two vertical extensions with the 
next feature below and above, where a feature is either another line segment or an 
edge of the bounding boz (0, 1]?. 


Figure gives an example. 

(The general-position assumptions are made only for convenience and simplicity of 
the presentation. The various degeneracies can be handled without too much trouble, 
though we will not get into the details.) 

The trapezoids of the trapezoidal map are “true” trapezoids (quadrangles with two 
parallel vertical sides), and triangles (which may be considered as degenerate trapezoids). 
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Figure 10.1: The trapezoidal map of five line segments (depicted in bold) 


10.2 Applications of trapezoidal maps 


In this chapter we will see two applications of trapezoidal maps (there are others): 


1. Point location: Given a set of n segments in the plane, we want to preprocess 
them in order to answer point-location queries: given a point p, return the cell 
(connected component of the complement of the segments) that contains p (see 
Figure [10.2). This is a more powerful alternative to Kirkpatrick’s algorithm that 
handles only triangulations, and which is treated in Section [9.5] The preprocessing 
constructs the trapezoidal map of the segments (Figure in expected time 
O(nlogn-+K), where K is the number of intersections between the input segments; 
the query time will be O(logn) in expectation. 


2. Line segment intersection: Given a set of n segments, we will report all segment 
intersections in expected time O(nlogn + K). This is a faster alternative to the 
line-sweep algorithm we saw in Section which takes time O((n + K) log n). 


10.3 Incremental Construction of the Trapezoidal Map 


We can construct the trapezoidal map by inserting the segments one by one, in random 
order, always maintaining the trapezoidal map of the segments inserted so far. In order 
to perform manipulations efficiently, we can represent the current trapezoidal map as a 
doubly-connected edge list (see Section [4.2), for example. 
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Figure 10.2: The general point location problem defined by a set of (possibly inter- 
secting) line segments. In this erample, the segments partition the plane 
into 5 cells. 


Suppose that we have already inserted segments s,,...,S;—1, and that the resulting 
trapezoidal map J,_; looks like in Figure Now we insert segment s, (see Fig- 


ure [10.4). 


Here are the four steps that we need to do in order to construct J,. 
1. Find the trapezoid [ly of J,_; that contains the left endpoint of s+. 


2. Trace s, through 7,_, until the trapezoid containing the right endpoint of s, is 
found. To get from the current trapezoid LJ to the next one, traverse the boundary 
of O until the edge is found through which s, leaves Ll. 


3. Split the trapezoids intersected by s,. A trapezoid Ll may get replaced by 


e two new trapezoids (if s, intersects two vertical extensions of O); 
e three new trapezoids (if s, intersects one vertical extension of O); 


e four new trapezoids (if s, intersects no vertical extension of O). 


4. Merge trapezoids by removing parts of vertical extensions that do not belong to J, 
anymore. 


Figure illustrates the Trace and Split steps. sg intersects 5 trapezoids, and they 
are being split into 3, 3, 4,3, and 3 trapezoids, respectively. 
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Figure 10.3: The trapezoidal map is a refinement of the partition of the plane into 
cells. For example, cell 3 1s a union of five trapezoids. 


The Merge step is shown in Figure In the example, there are two vertical 
edges that need to be removed (indicated with a cross) because they come from vertical 
extensions that are cut off by the new segment. In both cases, the two trapezoids to the 
left and right of the removed edge are being merged into one trapezoid (drawn shaded). 


10.4 Using trapezoidal maps for point location 


Recall that in the point location problem we want to preprocess a given set S of seg- 
ments in order to answer subsequent point-location queries: S partitions the plane into 
connected cells and we want to know, given a query point q, to which cell q belongs, 
see Figure 


Note that the trapezoidal map of S is a refinement of the partition of the plane into 
cells, in the sense that a cell might be partitioned into several trapezoids, but every 
trapezoid belongs to a single cell, see Figure [10.3] Thus, once the trapezoidal map of S is 
constructed, we can easily “glue together" trapezoids that touch along their vertical sides, 
obtaining the original cells. Then we can answer point-location queries using the same 
routine that performs the Find step (whose implementation will be described below). 
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Figure 10.4: A new segment (dashed) is to be inserted 


10.5 Analysis of the incremental construction 


In order to analyze the runtime of the incremental construction, we insert the segments 
in random order, and we employ the configuration space framework. We also implement 
the Find step in such a way that the analysis boils down to conflict counting, just as for 
the Delaunay triangulation. 


10.5.1 Defining The Right Configurations 


Recall that a configuration space is a quadruple $ = (X, M, D, K), where X is the ground 
set, IT is the set of configurations, D is a mapping that assigns to each configuration its 
defining elements (“generators”), and K is a mapping that assigns to each configuration 
its conflict elements (“killers”). 

It seems natural to choose X = S, the set of segments, and to define IT as the set 
of all possible trapezoids that could appear in the trapezoidal map of some subset of 
segments. Indeed, this satisfies one important property of configuration spaces: for each 
configuration, the number of generators is constant. 


Lemma 10.2 For every trapezoid O in the trapezoidal map of R C S, there exists a 
set D CR of at most four segments, such that O is in the trapezoidal map of D. 


Proof. By our general position assumption, each non-vertical side of [] is a subset of a 
unique segment in R, and each vertical side of O is induced by a unique (left or right) 
endpoint, or by the intersection of two unique segments. In the latter case, one of these 
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Figure 10.5: The Trace and Split steps 


segments also contributes a non-vertical side, and in the former case, we attribute the 
endpoint to the “topmost” segment with that (left or right) endpoint. It follows that 
there is a set of at most four segments whose trapezoidal map already contains O. OU 


But there is a problem with this definition of configurations. Recall that we can apply 
the general configuration space analysis only if 


(i) the cost of updating J,_, to J, is proportional to the structural change, the 
number of configurations in J, \ J,—1; and 


(ii) the expected cost of all Find operations during the randomized incremental con- 
struction is proportional to the expected number of conflicts. (This is the “conflict 
counting” part.) 


Here we see that already (i) fails. During the Trace step, we traverse the boundary of 
each trapezoid intersected by s, in order to find the next trapezoid. Even if s, intersects 
only a small number of trapezoids (so that the structural change is small), the traversals 
may take very long. This is due to the fact that a trapezoid can be incident to a large 
number of edges. Consider the trapezoid labeled U in Figure It has many incident 
vertical extensions from above. Tracing a segment through such a trapezoid takes time 
that we cannot charge to the structural change. 

To deal with this, we slightly adapt our notion of configuration. 


Definition 10.3 Let TI be the set of all trapezoids together with at most one incident 
vertical edge (“trapezoids with tail”) that appear in the trapezoidal map of some 
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Figure 10.6: The Merge steps 


Figure 10.7: Trapezords may have arbitrarily large complexity 


subset of X = S, see Figure A trapezoid without any incident vertical edge is 
also considered a trapezord with tail. 


As it turns out, we still have constantly many generators. 


Lemma 10.4 For every trapezoid with tail O in the trapezoidal map of R C S, there 
exists a set D CR of at most sir segments, such that O 1s in the trapezoidal map 
of D. 


Proof. We already know from Lemma that the trapezoid without tail has at most 
4 generators. And since the tail is induced by a unique segment endpoint or by the 
intersection of a unique pair of segments, the claim follows. O 


Here is the complete specification of our configuration space $ = (X, M, D,K). 
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Figure 10.8: A trapezoid with tail is a trapezoid together with at most one vertical 
edge attached to its upper or its lower segment. 


Definition 10.5 Let X = S be the set of segments, and II the set of all trapezoids with 
tail. For each trapezoid with tail O, D(O) is the set of at most 6 generators. K(L) 
is the set of all segments that intersect O in the interior of the trapezoid, or cut off 
some part of the tail, or replace the topmost generator of the left or right side, see 


Figure [10.9] 


Then $ = (X, TT, D, K) is a configuration space of dimension at most 6, by Lemma|10.4| 
The only additional property that we need to check is that D(O) N K(O) = / for all 
trapezoids with tail, but this is clear since no generator of Ll properly intersects the 
trapezoid of O or cuts off part of its tail. 


Figure 10.9: A trapezoid with tail O is in conflict with a segment s (dashed) if s 
intersects O tn the interior of the trapezoid (left), or cuts off part of the 
tail (middle), or is a new topmost segment generating a vertical side. 


10.5.2 Update Cost 


Now we can argue that the update cost can be bounded by the structural change. We 
employ the same trick as for Delaunay triangulations. We prove that the update cost 
is in each step r — 1 — r proportional to the number of configurations that are being 
destroyed. Over the whole algorithm, we cannot destroy more configurations than we 
create, so the bound that we get is also a bound in terms of the overall structural change. 
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Lemma 10.6 Jn updating J,_1 to Ty, the steps Trace, Split, and Merge can be performed 
in time proportional to the number of trapezoids with tail in Tr- \ Te. 


Proof. By definition, the complexity (number of edges) of a trapezoid is proportional 
to the number of trapezoids with tail that share this trapezoid. This means, the cost of 
traversing the trapezoid can be charged to the trapezoids with tail containing it, and all 
of them will be destroyed (this includes the trapezoids with tail that just change their 
left or right generator; in the configuration space, this is also a destruction). This takes 
care of the Trace step. The Split and Merge steps can be done within the same asymptotic 
time bounds since they can be performed by traversing the boundaries of all intersected 
trapezoids a constant number of times each. For efficiently doing the manipulations on 
the trapezoidal map, we can for example represent it using a doubly-connected edge list. 
O 


We can now employ the general configuration space analysis to bound the expected 
structural change throughout the randomized incremental construction; as previously 
shown, this asymptotically bounds the expected cost of the Trace, Split, and Merge steps 
throughout the algorithm. Let us recall the general bound. 


Theorem 10.7 Let S = (X,TT,D,K) be a configuration space of fixed dimension with 
|X| = n. The expected number of configurations that are created throughout the 
algorithm is bounded by 


Tt 
O E 
(E). 
r=1 
where t, is the expected size of J,, the expected number of active configurations 
after r insertion steps. 


10.5.3 The History Graph 


Here is how we realize the Find step (as well as the point-location queries for our point- 
location application). It is a straightforward history graph approach as for Delaunay 
triangulations. Every trapezoid that we ever create is a node in the history graph; 
whenever a trapezoid is destroyed, we add outgoing edges to its (at most four) successor 
trapezoids. Note that trapezoids are destroyed during the steps Split and Merge. In the 
latter step, every destroyed trapezoid has only one successor trapezoid, namely the one 
it is merged into. It follows that we can prune the nodes of the “ephemeral” trapezoids 
that exist only between the Split and Merge steps. What we get is a history graph of 
degree at most 4, such that every non-leaf node corresponds to a trapezoid in J,_1 \ Jr, 
for some rT. 
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10.5.4 Cost of the Find step 


We can use the history graph for point location during the Find step. Given a segment 
endpoint p, we start from the bounding box (the unique trapezoid with no generators) 
that is certain to contain p. Since for every trapezoid in the history graph, its area is 
covered by the at most four successor trapezoids, we can simply traverse the history 
graph along directed edges until we reach a leaf that contains p. This leaf corresponds to 
the trapezoid of the current trapezoidal map containing p. By the outdegree-4-property, 
the cost of the traversal is proportional to the length of the path that we traverse. 

Here is the crucial observation that allows us to reduce the analysis of the Find step 
to “conflict counting”. Note that this is is precisely what we also did for Delaunay 
triangulations, except that there, we had to deal explicitly with “ephemeral” triangles. 

Recall Definition according to which a conflict is a pair (UU, s) where O is a 
trapezoid with tail, contained in some intermediate trapezoidal map, and s € K(L). 


Lemma 10.8 During a run of the incremental construction algorithm for the trape- 
zordal map, the total number of history graph nodes traversed during all Find steps 
is bounded by the number of conflicts during the run. 


Proof. Whenever we traverse a node (during insertion of segment s,, say), the node 
corresponds to a trapezoid [O (which we also consider as a trapezoid with tail) in some 
set J;,s < r, such that p € O, where p is the left endpoint of the segment s,. We can 
therefore uniquely identify this edge with the conflict (O, s+). The statement follows. O 


Now we can use the configuration space analysis that precisely bounds the expected 
number of conflicts, and therefore the expected cost of the Find steps over the whole 
algorithm. Let us recapitulate the bound. 


Theorem 10.9 Let S = (X, TI, D,K) be a configuration space of fixed dimension d with 
IX] =n. The expected number of conflicts during randomized incremental construc- 
tion of Jn 1s bounded by 


Tt 
$ 
r=1 
where t, 1s as before the expected size of J,. 


10.5.5 Applying the General Bounds 


Let us now apply Theorem[10.7|and Theorem|10.9|to our concrete situation of trapezoidal 
maps. What we obviously need to determine for that is the quantity t,, the expected 
number of active configurations after r insertion steps. 

Recall that the configurations are the trapezoids with tail that exist at this point. 
The first step is easy. 
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Observation 10.10 In every trapezoidal map, the number of trapezoids with tail is 
proportional to the number vertices. 


Proof. Every trapezoid with tail that actually has a tail can be charged to the vertex of 
the trapezoidal map on the “trapezoid side” of the tail. No vertex can be charged twice 
in this way. The trapezoids with no tail are exactly the faces of the trapezoidal map, 
and since the trapezoidal map is a planar graph, their number is also proportional to the 
number vertices. O 


Using this observation, we have therefore reduced the problem of computing t, to the 
problem of computing the expected number of vertices in J,. To count the latter, we 
note that every segment endpoint and every segment intersection generates 3 vertices: 
one at the point itself, and two where the vertical extensions hit another feature. Here, 
we are sweeping the 4 bounding box vertices under the rug. 


Observation 10.11 In every trapezordal map of r segments, the number of vertices 1s 
6r + 3k, 


where k is the number of pairwise intersections between the r segments. 


So far, we have not used the fact that we have a random insertion order, but this 
comes next. 


Lemma 10.12 Let K be the total number of pairwise intersections between segments 
in S, and let k, be the random variable for the expected number of pairwise in- 
tersecttons between the first r segments inserted during randomized incremental 
construction. Then 


Proof. Let us consider the intersection point of two fixed segments s and s’. This 
intersection point appears in J, if and only both s and s’ are among the first r segments. 
There are des ways of choosing the set of r segments (and all choices have the same prob- 
ability); since the number of r-element sets containing s and s’ is ee the probability 
for the intersection point to appear is 


(a). Fr 1) 


(©) n(n—1) 


r 


Summing this up over all K intersection points, and using linearity of expectation, the 
statement follows. O 


From Observation|10.10| Observation|10.11land Lemma|10.12| we obtain the following 
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Corollary 10.13 The expected number t, of active configurations after r insertion steps 


1s 
t=0 (E2). 
n 


Plugging this into Theorem and Theorem|10.9| we obtain the following final 


Theorem 10.14 Let S be a set of n segments in the plane, with a total of K pairwise 
intersections. The randomized incremental construction computes the trapezoidal 
map of S in time 


O(nlogn + K). 


Proof. We already know that the expected update cost (subsuming steps Trace, Split, and 
Merge) is proportional to the expected overall structural change, which by Theorem 
is 


O (Es) =0(n)+0 (SÈ) =0(n+K). 


We further know that the expected point location cost (subsuming step Find) is pro- 
portional to the overall expected number of conflicts which by Theorem is 


ofn): "|= otntogn) +0 (* yi)- O(nlogn + K). 


10.6 Analysis of the point location 


Finally, we return to the application of trapezoidal maps for point location. We make 
precise what we mean by saying that “point-location queries are handled in O(logn) 
expected time", and we prove our claim. 


Lemma 10.15 Let S = [s¡,...,sn) be any set of n segments. Then there exists a 
constant c > 0 such that, with high probability (meaning, with probability tending 
to 1 as n —> co), the history graph produced by the random incremental construction 
answers every possible poznt-location query in time at most clogn. 


Note that our only randomness assumption is over the random permutation of S 
chosen at the beginning of the incremental construction. We do not make any randomness 
assumption on the given set of segments. 

The proof of Lemma is by a typical application of Chernoff’s bound followed 
by the union bound. 

Recall (or please meet) Chernoff’s bound: 
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Lemma 10.16 Let Xi, X2,...,Xn be independent 0/1 random variables, and let X = 
X1+-::+Xpn. Let pi = PrlX; = 1], and let u=ElX] =p,+:::+Ppn. Then, 


-5 u 
Pr[X < (1 — ô)u] < “sa for every0<ó5<l; 
e u 
Pr[X > (1+ ô8)u] < (sa) for every 5 > 0. 


The important thing to note is that e~°/(1—5)+~° as well as e? /(1+58)**° are strictly 
less than 1 for every fixed ô > 0, and decrease with increasing ô. 

Now back to the proof of Lemma [10.15| 
Proof. First note that, even though there are infinitely many possible query points, there 
is only a finite number of combinatorially distinct possible queries: If two query points 
lie together in every trapezoid (either both inside or both outside), among all possible 
trapezoids defined by segments of S, then there is no difference in querying one point 
versus querying the other, as far as the algorithm is concerned. Since there are O(n%) 
possible trapezoids (recall that each trapezoid is defined by at most four segments), there 
are only O(n*) queries we have to consider. 

Fix a query point q. We will show that there exists a large enough constant c > 0, 
such that only with probability at most O(n~°) does the query on q take more than 
clogn steps. 

Let s1, S2,..., Sn be the random order of the segments chosen by the algorithm, and 
for 1 < r < n let J, be the trapezoidal map generated by the first r segments. Note 
that for every r, the point q belongs to exactly one trapezoid of J}. The question is how 
many times the trapezoid containing q changes during the insertion of the segments, 
since these are exactly the trapezoids of the history graph that will be visited when we 
do a point-location query on q. 

For 1 <r < n, let A, be the event that the trapezoid containing q changes from 
T,-1 to T,. What is the probability of A,? As in Section we “run the movie 
backwards": To obtain J,_; from J,, we delete a random segment from among s),..., Sr} 
the probability that the trapezoid containing q in J, is destroyed is at most 4/r, since 
this trapezoid is defined by at most four segments. Thus, Pr[A,] < 4/r, independently 
of every other A,, s AT. 

For each r let X, be a random variable equal to 1 if A, occurs, and equal to O otherwise. 
We are interested in the quantity X = X, +---+ X,. Then u = E[X] = sear PrlA,] = 
4lnn + O(1). Applying Chernoff’s bound with ô = 2 (it is just a matter of choosing ô 
large enough), we get 


Pr[X > (1 + 5) u] < 0.2734 = 0(0.2734=") — O(n7®!9), 


so we can take our c to be anything larger than 4(1 + ô) = 12. 
Thus, for every fixed query q, the probability of a “bad event" (a permutation that 
results in a long query time) is O(n~°). Since there are only O(n*) possible choices for 
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q, by the union bound the probability of some q having a bad event is O(1/n), which 
tends to zero with n. O 


10.7 The trapezoidal map of a simple polygon 


An important special case of the trapezoidal map is obtained when the input segments 
form a simple polygon; seee Figure In this case, we are mostly interested in the 
part of the trapezoidal map inside the polygon, since that parts allows us to obtain a 
triangulation of the polygon in linear time. 


Figure 10.10: The trapezoidal map inside a simple polygon 


To get a triangulation, we first go through all trapezoids; we know that each trapezoid 
must have one polygon vertex on its left and one on its right boundary (due to general 
position, there is actually exactly one vertex on each of these two boundaries). Whenever 
the segment connecting these two vertices is not an edge of the polygon, we have a 
diagonal, and we insert this diagonal. Once we have done this for all trapezoids, it is 
easily seen that we have obtained a subdivision of the polygon into x-monotone polygons, 
each of which can be triangualated in linear time; see Exercise This immediately 
allows us to improve over the statement of Exercise [2.16] 


Corollary 10.17 A simple polygon with n vertices can be triangulated in expected time 
O(nlogn). 


Proof. By Theorem the trapezoidal decomposition induced by the segments 
of a simple polygon can be combuted in expected time O(nlogn), since there are no 
intersections between the segments. Using the above linear-time triangulation algorithm 
from the trapezoidal map, the result follows. O 
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The goal of this section is to further improve this bound and show the following 
result. 


Theorem 10.18 Let S = {s1, S2,...,Sn} be the set of edges of an n-verter simple poly- 
gon, in counterclockwise order around the polygon. The trapezoidal map induced by 
S (and thus also a triangulation of S) can be computed in exptected time O(n log” n). 


Informally speaking, the function log* n is the number of times we have to iterate the 
operation of taking (binary) logarithms, before we get from n down to 1. Formally, we 
define 


log’™’(n) -{ log "3 (log n), otherwise 


as the h-times iterated logarithm, and for n > 1 we set 
log* n = max{h: log") n > 1). 

For example, we have 
log* (295536) = 5, 


meaning that for all practical purposes, log*n < 5; a bound of O(nlog* n) is therefore 
very close to a linear bound. 


History flattening. Recall that the bottleneck in the randomized incremental construction 
is the Find step. Using the special structure we have (the segments form a simple polygon 
in this order), we can speed up this step. Suppose that at some point during incremental 
construction, we have built the trapezoidal map of a subset of r segments, along with 
the history graph. We now flatten the history by removing all trapezoids that are not in 
J,, the current trapezoidal map. To allow for point location also in the future, we need 
an “entry point” into the flattened history, for every segment not inserted so far (the old 
entry point for all segments was the bounding unit square (0, 1]). 


Lemma 10.19 Let S be the set of edges of an n-verter simple polygon, in counterclock- 
wise order around the polygon. For RCS, let T(R) be the trapezoidal map induced 
by R. In time proportional to n plus the number of conflicts between trapezoids in 
T(R) and segments in SY R, we can find for all segment s € S a trapezoid of T(R) 
that contains an endpoint of s. 


Proof. In a trivial manner (and in time O(n)), we do this for the first segment s, and 
its first endpoint pı. Knowing the trapezoid Lj; containing pi, the first endpoint of si, 
we trace the segment si through J(R) until p;,,, its other endpoint and first endpoint 
of si+ı is found, along with the trapezoid i+; containing it. This is precisely what we 
also did in the Trace Step 2 of the incremental construction in Section [10.3] 
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By Lemma [10.6] (pretending that we are about to insert si), the cost of tracing si 
through J(R) is proportional to the size of J(R) \ T(R U(s;¡)), equivalently, the number 
of conflicts between trapezoids in J(R) and s;. The statement follows by adding up the 
costs for all i. O 


This is exactly where the special structure of our segments forming a polygon helps. 
After locating pi, we can locate the next endpoint pi+ı in time proportional to the 
structural change that the insertion of s; would cause. We completely avoid the traversal 
of old history trapezoids that would be necessary for an efficient location of pj,, from 
scratch. 

Next we show what Lemma [10.19] gives us in expectation. 


Lemma 10.20 Let S be the set of edges of an n-verter simple polygon, in counter- 
clockwise order around the polygon, and let J, be the trapezoidal map obtained 
after inserting r segments in random order. In expected time O(n), we can find for 
each segment s not inserted so far a trapezoid of J, containing an endpoint of s. 


Proof. According to Lemma|10.19| the expected time is bounded by O(n + ¢(r)), where 


U(r) = al > Y te sR) :y e KO}. 


RCS,|R|=r yeS\R 


This looks very similar to the bound on the quantity k(r) that we have derived in 
Section [8.5] to count the expected number of conflicts in general configuration spaces: 


d 
k(r) < a) Y EY ese): ye Kay. 


RCS,|Rl=r -yeS\R 
Indeed, the difference is only an additional factor of a in the latter. For k(r), we have 
then computed the bound 


2 


r(r + 1) (n—1)tr+1, (10.21) 


klt) Ka ( ka +kalt) < Enri mty + 


where t, is the expected size of J.. Hence, to get a bound for l(r), we simply have to 
cancel a from all terms to obtain 


A E O A ones Som, 


r+1 


since t, < t,+1 = O(r) in the case of nonintersecting line segments. O 
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The faster algorithm. We proceed as in the regular randomized incremental construction, 
except that we frequently and at well-chosen points flatten the history. Let us define 


Nin) =| 


We have N(0) = 1, N(log*n) < n and N(log*n +1) >n. We insert the segments in 
random order, but proceed in log* n + 1 rounds. In round h = 1,...,log*n+1, we do 
the following. 


n 
——— |, O0<h<logřn. 
log”) -l : 


(i) Flatten the history graph by finding for each segment s not inserted so far a trape- 
zoid of the current trapezoidal map containing an endpoint of s. 


(ii) Insert the segments N(h — 1) up to N(h) — 1 in the random order, as usual, but 
starting from the flat history established in (i). 


In the last round, we have N(h) — 1 > n, so we stop with segment n in the random 
order. 

From Lemma we know that the expected cost of step (i) over all rounds is 
bounded by O(nlog* n) which is our desired overall bound. It remains to prove that 
the same bound also deals with step (ii). We do not have to worry about the overall 
expected cost of performing the structural changes in the trapezoidal map: this will be 
bounded by O(n), using t, = O(r) and Theorem It remains to analyze the Find 
step, and this is where the history flattening leads to a speedup. Adapting Lemma [10.8] 
and its proof accordingly, we obtain the following. 


Lemma 10.22 During round h of the fast incremental construction algorithm for the 
trapezoidal map, the total number of history graph nodes traversed during all Find 
steps 1s bounded by N(h)—N(h—1F} plus the number of conflicts between trapezoids 
that are created during round h, and segments inserted during round h. 


Proof. The history in round h only contains trapezoids that are active at some point in 
round h. On the one hand, we have the “root nodes” present immediately after flattening 
the history, on the other hand the trapezoids that are created during the round. The 
term N(h)—N(h—1) accounts for the traversals of the root nodes during round h. As in 
the proof of Lemma the traversals of other history graph nodes can be charged to 
the number of conflicts between trapezoids that are created during round h and segments 
inserted during round h. O 

To count the expected number of conflicts involving trapezoids created in round h, we 
go back to the general configuration space framework once more. With k;, (r) being the 
expected number of conflicts created in step r, restricted to the ones involving segments 
inserted in round h, we need to bound 

N(h)-1 


k(h) = kn(r). 
r=N(h-1) 


lthis amounts to O(n) throughout the algorithm and can therefore be neglected 
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As in (10.21), we get 


2 
kn(t) < Nie — S(N(h) — a + N a 
r Tr r(r+1) 


where we have replaced n with N(h), due to the fact that we do not need to consider 
segments in later rounds. Then t, = O(r) yields 


Kh) < OfN Y= 
MA 

= o (Ninos qq) 
1 (h—1) 

6 (Niis E) 


Il 


O (Nin) log"? n) = O(n). 


It follows that step (ii) of the fast algorithm also requires expected linear time per 
round, and Theorem }10.18) follows. 


Exercise 10.23 a) You are given a set of n pairwise disjoint line segments. Find 
an algorithm to answer vertical ray shooting queries in O(logn) time. That is, 
preprocess the data such that given a query point q you can report in O(log n) 
time which segment is the first above q (or if there are none). Analyze the 
running tume and the space consumption of the preprocessing. 


b) What happens if we allow intersections of the line segments? Ezplain in a 
few words how you have to adapt your solution and how the time and space 
complexity would change. 


Exercise 10.24 Show that an n-verter x-monotone polygon can be triangulated in time 
O(n). (As usual a polygon is given as a list of its vertices in counterclockwise order. 
A polygon P is called x-monotone if for all vertical lines £, the intersection (MP 
has at most one component.) 


Questions 


41. What is the definition of a trapezoidal map? 


42. How does the random incremental construction of a trapezoidal map proceed? 
What are the main steps to be executed at each iteration? 


43. How can trapezoidal maps be used for the point location problem? 
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44. 


45. 


46. 


47. 


48. 


49. 


50. 


What is the configuration space framework? Recall Section [8.3] 


What is a naive way of defining a configuration in the case of trapezoids, and 
why does it fail? 


What 1s a more successful way of defining a configuration? Why do things 
work in this case? 


What is the history graph, and how 1s it used to answer point location queries? 


What is the performance of the random incremental construction of the trape- 
zoidal map when used for the point-location problem? Be precise! 


What probabilistic techniques are used in proving this performance bound? 


How can you speed up the randomized incremental construction in the case 
where the input segments from a simple polygon? Sketch the necessary changes 
to the algorithm, and how they affect the analysis. 
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Line Arrangements 


During the course of this lecture we encountered several situations where it was conve- 
nient to assume that a point set is “in general position”. In the plane, general position 
usually amounts to no three points being collinear and/or no four of them being cocircu- 
lar. This raises an algorithmic question: How can we test for n given points whether or 
not three of them are collinear? Obviously, we can test all triples in O(n?) time. Can we 
do better? Yes, we can! Using a detour through the so-called dual plane, we will see that 
this problem can be solved in O(n?) time. However, the exact algorithmic complexity 
of this innocent-looking problem is not known. In fact, to determine this complexity is 
one of the major open problems in theoretical computer science. 

We will get back to the complexity theoretic problems and ramifications at the end 
of this chapter. But first let us discuss how to obtain a quadratic time algorithm to test 
whether n given points in the plane are in general position. This algorithm is a nice ap- 
plication of the projective duality transform, as defined below. Such transformations are 
very useful because they allow us to gain a new perspective on a problem by formulating 
it in a different but equivalent form. Sometimes such a dual form of the problem is easier 
to work with and—given that it is equivalent to the original primal form—any solution 
to the dual problem can be translated back into a solution to the primal problem. 

So what is this duality transform about? Observe that points and hyperplanes in R4 
are very similar objects, given that both can be described using d coordinates/parameters. 
It is thus tempting to match these parameters to each other and so create a mapping 
between points and hyperplanes. In R? hyperplanes are lines and the standard projec- 
tive duality transform maps a point p = (px, py) to the line p* : y= pxx—py and a 
non-vertical line g : y= mx + b to the point g* = (m, —b). 


Proposition 11.1 The standard projective duality transform is 
e incidence preserving: p E g => g* €p* and 
e order preserving: p 1s above g <> g* ts above p*. 


Exercise 11.2 Prove Proposition 
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Exercise 11.3 Describe the image of the following point sets under this mapping 
a) a halfplane 
b) k >3 collinear points 
c) a line segment 
d) the boundary points of the upper convex hull of a finite point set. 


Another way to think of duality is in terms of the parabola Y : y = pe. For a point 
p on ?, the dual line p* is the tangent to P at p. For a point p not on P, consider the 
vertical projection p’ of p onto P: the slopes of p* and p’* are the same, just p* is shifted 
by the difference in y-coordinates. 


Figure 11.1: Point + line duality with respect to the parabola P: y = me. 


The question of whether or not three points in the primal plane are collinear trans- 
forms to whether or not three lines in the dual plane meet in a point. This question in 
turn we will answer with the help of line arrangements, as defined below. 


11.1 Arrangements 


The subdivision of the plane induced by a finite set L of lines is called the arrangement 
A(L). Observe that all cells of the subdivision are intersections of halfplanes and thus 
convex. A line arrangement is simple if no two lines are parallel and no three lines meet 
in a point. Although lines are unbounded, we can regard a line arrangement a bounded 
object by (conceptually) putting a sufficiently large box around that contains all vertices. 
Such a box can be constructed in O(n log n) time for n lines. 


Exercise 11.4 How? 
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Moreover, we can view a line arrangement as a planar graph by adding an additional 
vertex at “infinity”, that is incident to all rays which leave this bounding box. For 
algorithmic purposes, we will mostly think of an arrangement as being represented by a 
doubly connected edge list (DCEL), cf. Section [4.2] 


n 


z) vertices, n? edges, 


Theorem 11.5 A simple arrangement A(L) ofn lines in R? has ( 
and (3) +n+1 faces/cells. 


Proof. Since all lines intersect and all intersection points are pairwise distinct, there are 
(2) vertices. 

The number of edges we count using induction on n. For n = 1 we have 1? = 1 edge. 
By adding one line to an arrangement of n — 1 lines we split n — 1 existing edges into 
two and introduce n new edges along the newly inserted line. 'Thus, there are in total 
(n— 1) +2n—1 =n*-2n+1+2n-— 1 = n? edges. 

The number f of faces can now be obtained from Euler's formula v—e+f = 2, where 
v and e denote the number of vertices and edges, respectively. However, in order to 
apply Euler's formula we need to consider A(L) as a planar graph and take the symbolic 
“infinite” vertex into account. Therefore, 


f=2- (G) +1) $n? =14 (00 n(n—1)) =1+3(n2+n) =i (5) +n. 


The complexity of an arrangement is simply the total number of vertices, edges, and faces 
(in general, cells of any dimension). 


Exercise 11.6 Consider a set of lines in the plane with no three intersecting in a 
common point. Form a graph G whose vertices are the intersection points of the 
lines and such that two vertices are adjacent if and only if they appear consecutively 
along one of the lines. Prove that x(G) < 3, where x(G) denotes the chromatic 
number of the graph G. In other words, show how to color the vertices of G using 
at most three colors such that no two adjacent vertices have the same color. 


11.2 Construction 


As the complexity of a line arrangement is quadratic, there is no need to look for a sub- 
quadratic algorithm to construct it. We will simply construct it incrementally, inserting 
the lines one by one. Let f,,...,f, be the order of insertion. 

At Step i of the construction, locate 0; in the leftmost cell of A((%,,...,l;¡_1)) it 
intersects. (The halfedges leaving the infinite vertex are ordered by slope.) This takes 
O(i) time. Then traverse the boundary of the face F found until the halfedge h is found 
where l; leaves F (see Figure for illustration). Insert a new vertex at this point, 
splitting F and h and continue in the same way with the face on the other side of h. 
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Figure 11.2: Incremental construction: Insertion of a line l. (Only part of the ar- 
rangement 1s shown in order to increase readability. ) 


The insertion of a new vertex involves splitting two halfedges and thus is a constant 
time operation. But what is the time needed for the traversal? The complexity of 
A({li,..., G1} is O(12), but we will see that the region traversed by a single line has 
linear complexity only. 


11.3 Zone Theorem 


For a line £ and an arrangement A(L), the zone Z4(1)(€) of £ in A(L) is the set of cells 
from A(L) whose closure intersects 0. 


Theorem 11.7 Given an arrangement A(L) of n lines in R? and a line (not neces- 
sarily from L), the total number of edges in all cells of the zone Z4(1,(l) 1s at most 
6n. 


Proof. Without loss of generality suppose that £ is horizontal and that none of the lines 
from L is horizontal. (The first condition can be addressed by rotating the plane and 
the second by deciding that the left vertex of a horizontal edge is higher than the right 
vertex.) 

For each cell of Z4(1)(0) split its boundary at its topmost 
vertex and at its bottommost vertex and orient all edges from 
bottom to top. Those edges that have the cell to their right are 
called left-bounding for the cell and those edges that have the 
cell to their left are called right-bounding. For instance, for the 
cell depicted to the right all left-bounding edges are shown blue 
and bold. 

We will show that there are at most 3n left-bounding edges in Z4(,)(¢) by induction 
on n. By symmetry, the same bound holds also for the number of right-bounding edges 
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in Z A(L) (£) . 

For n = 1, there is at most one (exactly one, unless £ is parallel to and lies below the 
only line in L) left-bounding edge in Z4(1)(() and 1 < 3n =3. Assume the statement is 
true for n— 1. 


Figure 11.3: At most three new left-bounding edges are created by adding r to A(L\{r}). 


If no line from L intersects £, then all lines in L U {£} are parallel and there is at 
most 1 < 3n left-bounding edge in Za(1)(£). Else consider the rightmost line r from L 
intersecting l and the arrangement A(L \{r}). By the induction hypothesis there are at 
most 3n — 3 left-bounding edges in Z4(1\1+})(¢). Adding r back adds at most three new 
left-bounding edges: At most two edges (call them ĉo and ¢,) of the rightmost cell of 
Zat} (£) are intersected by r and thereby split in two. Both of these two edges may be 
left-bounding and thereby increase the number of left-bounding edges by at most two. 
In any case, r itself contributes exactly one more left-bounding edge to that cell. The 
line r cannot contribute a left-bounding edge to any cell other than the rightmost: to 
the left of r, the edges induced by r form right-bounding edges only and to the right 
of r all other cells touched by r (if any) are shielded away from £ by one of ly or 0}. 
Therefore, the total number of left-bounding edges in Z4(1)(£) is bounded from above 
by 3+ 3n-— 3 = 3n. O 


Corollary 11.8 The arrangement of n lines in R? can be constructed in optimal O(n?) 
tume and space. 


Proof. Use the incremental construction described above. In Step i, for 1 <i<n, 
we do a linear search among i — 1 elements to find the starting face and then traverse 
(part of) the zone of the line ¢; in the arrangement A((l;,...,l;-1)). By Theorem [11.7] 
the complexity of this zone and hence the time complexity of Step i altogether is O(i). 
Overall we obtain ) ||, ci = O(n?) time (and space), for some constant c > 0, which is 
optimal by Theorem [11.5] O 


The corresponding bounds for hyperplane arrangements in Rt are O(n*) for the com- 
plexity of a simple arrangement and O(n“1) for the complexity of a zone of a hyperplane. 
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Exercise 11.9 For an arrangement A of a set of n lines in R?, let F := Ue is cen ga 
denote the union of the closure of all bounded cells. Show that the complexity 
(number of vertices and edges of the arrangement lying on the boundary) of F is 


O(n). 


11.4 The Power of Duality 


The real beauty and power of line arrangements becomes apparent in context of projective 
point © line duality. The following problems all can be solved in O(n?) time and space 
by constructing the dual arrangement. 


General position test. Given n points in R?, are any three of them collinear? (Dual: do 
three lines meet in a point?) 


Minimum area triangle. Given n points in R?, what is the minimum area triangle spanned 
by any three of them? For any vertex (* of the dual arrangement (primal: line £ through 
two points p and q) find the closest point vertically above/below (* through which an 
input line passes (primal: closest line below/above and parallel to £ that passes through 
an input point). In this way one can find O(n?) candidate triangles by constructing the 
arrangement of the n dual lines!] The smallest among those candidates can be determined 
by a straightforward minimum selection (comparing the area of the corresponding trian- 
gles). Observe that vertical distance is not what determines the area of the corresponding 
triangle but orthogonal distance. However, the points that minimize these measures for 
any fixed line are the same... 


Exercise 11.10 A set P of n points in the plane is said to be in e-general position for 
e >0 2f no three points of the form 


pP F (x1, Yi), q ag (X2, Ya), Y F (x3, Ya) 


are collinear, where p, q,r € P and |x|, lyi] < e, for i € {1,2,3}. In words: P remains 
in general position under changing point coordinates by less than e each. 

Give an algorithm with runtime O(n?) for checking whether a given point set P 
is in e-general position. 


11.5 Sorting all Angular Sequences. 


Theorem 11.11 Consider a set P of n points in the plane. For a point q € P let cp(q) 
denote the circular sequence of points from S\ {q} ordered counterclockwise around 


¡For instance, maintain over the incremental construction for each vertex a vertically closest line. The 
number of vertices to be updated during insertion of a line £ corresponds to the complexity of the zone of 
l in the arrangement constructed so far. Therefore maintaining this information comes at no extra cost 
asymptotically. 
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q (in order as they would be encountered by a ray sweeping around q). All cp(q), 
q € P, collectively can be obtained in O(n?) time. 


Proof. Assume without loss of generality that no two points in P have the same x- 
coordinate (else rotate the plane infinitesimally). Consider the projective dual P* of P. 
An angular sweep around a point q € P in the primal plane corresponds to a traversal 
of the line q* from left to right in the dual plane. (A collection of lines through a single 
point q corresponds to a collection of points on a single line q* and slope corresponds 
to x-coordinate.) Clearly, the sequence of intersection points along all lines in P* can 
be obtained by constructing the arrangement in O(n?) time. In the primal plane, any 
such sequence corresponds to an order of the remaining points according to the slope of 
the connecting line; to construct the circular sequence of points as they are encountered 
around q, we have to split the sequence obtained from the dual into those points that 
are to the left of q and those that are to the right of q; concatenating both yields the 
desired sequence. O 


Exercise 11.12 (Eppstein [? ]) Describe an O(n?) time algorithm that given a set P of 
n points in the plane finds a subset of five points that form a strictly conver empty 
pentagon (or reports that there is none if that is the case). Empty means that the 
conver pentagon may not contain any other points of P. 

Hint: Start with a point p € P that ts extremal in one direction and try to find 
out whether there is a solution P’ containing p. For this, consider the star-shaped 
polygon that visits all points in radial order, as seen from p. 

Remark: It was shown by Harborth [? ] that every set of ten or more points in 
general position contains a subset of five points that form a strictly conver empty 
pentagon. 


11.6 Segment Endpoint Visibility Graphs 


A fundamental problem in motion planning is to find a short(est) path between two 
given positions in some domain, subject to certain constraints. As an example, suppose 
we are given two points p,q € R? and a set S C R? of obstacles. What is the shortest 
path between p and q that avoids S? 


Observation 11.13 The shortest path (if 1t exists) between two points that does not 
cross a finite set of finite polygonal obstacles is a polygonal path whose interior 
vertices are obstacle vertices. 


One of the simplest type of obstacle conceivable is a line segment. In general the 
plane may be disconnected with respect to the obstacles, for instance, if they form a 
closed curve. However, if we restrict the obstacles to pairwise disjoint line segments then 
there is always a free path between any two given points. Apart from start and goal 
position, by the above observation we may restrict our attention concerning shortest 
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paths to straight line edges connecting obstacle vertices, in this case, segment endpoints. 


Definition 11.14 Consider a set S of n disjoint line segments in R?. The segment 
endpoint visibility graph V(S) is a geometric straight line graph defined on the segment 
endpoints. Two segment endpoints p and q are connected in V(S) 2f and only if 


e the line segment pg is in S or 


e pqnsC {p,q} for every segment s € S. 


A 
\ 


Figure 11.4: A set of disjoint lane segments and their endpoint visibility graph. 


If all segments are on the convex hull, the visibility graph is complete. If they form 
parallel chords of a convex polygon, the visibility graph consists of copies of K4, glued 
together along opposite edges and the total number of edges is linear only. 

These graphs also appear in the context of the following question: Given a set of 
disjoint line segments, is it possible to connect them to form (the boundary of) a simple 
polygon? Is it easy to see that this is not possible in general: Just take three parallel 
chords of a convex polygon (Figure[11.5a). However, if we do not insist that the segments 
appear on the boundary, but allow them to be diagonals or epigonals, then it is always 
possible [4]. In other words, the segment endpoint visibility graph of disjoint line 
segments is Hamiltonian, unless all segments are collinear. It is actually essential to 


allow epigonals and not only diagonals [9] [3] (Figure [11.5b). 


| | \ AN 


Figure 11.5: Sets of disjoint line segments that do not allow certain polygons. 


Constructing V(S) for a given set S of disjoint segments in a brute force way takes 
O(n?) time. (Take all pairs of endpoints and check all other segments for obstruction.) 
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Theorem 11.15 (Welzl [10]) The segment endpoint visibility graph of n disjoint line 
segments can be constructed in worst case optimal O(n?) time. 


Proof. For simplicity we assume general position, that is, no three endpoints are collinear 
and no two have the same x-coordinate. It is no problem to handle such degeneracies 
explicitly. 

We have seen above how all sorted angular sequences can be obtained from the dual 
line arrangement in O(n?) time. Topologically sweep the arrangement from left to right 
(corresponds to changing the slope of the primal rays from —oo to +00) while maintaining 
for each segment endpoint p the segment s(p) it currently “sees” (if any). Initialize by 
brute force in O(n?) time (direction vertically downwards). Each intersection of two lines 
corresponds to two segment endpoints “seeing” each other along the primal line whose 
dual is the point of intersection. In order to process an intersection, we only need that 
all preceding (located to the left) intersections of the two lines involved have already 
been processed. This order corresponds to a topological sort of the arrangement graph 
where all edges are directed from left to right. (Clearly, this graph is acyclic, that is, it 
does not contain a directed cycle.) A topological sort can be obtained, for instance, via 
(reversed) post order DF'S in time linear in the size of the graph (number of vertices and 
edges), which in our case here is O(n”). 

When processing an intersection, there are four cases. Let p and q be the two points 
involved such that p is to the left of q. 


1. The two points belong to the same input segment — output the edge pq, no change 
otherwise. 


2. q is obscured from p by s(p) > no change. 
3. q is endpoint of s(p) > output pq and update s(p) to s(q). 


4. Otherwise q is endpoint of a segment t that now obscures s(p) —> output pq and 
update s(p) to t. 


Thus any intersection can be processed in constant time and the overall runtime of this 
algorithm is quadratic. O 


11.7 Ham Sandwich Theorem 


Suppose two thieves have stolen a necklace that contains rubies and diamonds. Now it 
is time to distribute the prey. Both, of course, should get the same number of rubies 
and the same number of diamonds. On the other hand, it would be a pity to completely 
disintegrate the beautiful necklace. Hence they want to use as few cuts as possible to 
achieve a fair gem distribution. 

To phrase the problem in a geometric (and somewhat more general) setting: Given 
two finite sets R and D of points, construct a line that bisects both sets, that is, in either 
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halfplane defined by the line there are about half of the points from R and about half of 
the points from D. 'To solve this problem, we will make use of the concept of levels in 
arrangements. 


Definition 11.16 Consider an arrangement A(L) induced by a set L of n non-vertical 
lines in the plane. We say that a point p is on the k-level in A(L) if there are at 
most k—1 lines below and at most n—k lines above p. The 1-level and the n-level 
are also referred to as lower and upper envelope, respectzvely. 


Figure 11.6: The 3-level of an arrangement. 


Another way to look at the k-level is to consider the lines to be real functions; then the 
lower envelope is the pointwise minimum of those functions, and the k-level is defined 
by taking pointwise the k**-smallest function value. 


Theorem 11.17 Let R,D C R? be finite sets of points. Then there exists a line that 
bisects both R and D. That is, in either open halfplane defined by l there are no 
more than |R|/2 points from R and no more than |D|/2 points from D. 


Proof. Without loss of generality suppose that both |R| and |D| are odd. (If, say, |R| is 
even, simply remove an arbitrary point from R. Any bisector for the resulting set is also 
a bisector for R.) We may also suppose that no two points from RUD have the same 
x-coordinate. (Otherwise, rotate the plane infinitesimally. ) 

Let R* and D* denote the set of lines dual to the points from R and D, respectively. 
Consider the arrangement A(R*). The median level of A(R*) defines the bisecting lines 
for R. As |R| = |R*| is odd, both the leftmost and the rightmost segment of this level 
are defined by the same line £, from R*, the one with median slope. Similarly there is a 
corresponding line la in A(D*). 

Since no two points from RUD have the same x-coordinate, no two lines from R* UD* 
have the same slope, and thus £ĉ, and fg intersect. Consequently, being piecewise linear 
continuous functions, the median level of A(R*) and the median level of A(D*) intersect 
(see Figure [11.7] for an example). Any point that lies on both median levels corresponds 
to a primal line that bisects both point sets simultaneously. O 
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Figure 11.7: An arrangement of 3 green lines (solid) and 3 blue lines (dashed) and 
their median levels (marked bold on the right hand side). 


How can the thieves use Theorem If they are smart, they drape the necklace 
along some convex curve, say, a circle. Then by Theorem [11.17] there exists a line that 
simultaneously bisects the set of diamonds and the set of rubies. As any line intersects 
the circle at most twice, the necklace is cut at most twice. It is easy to turn the proof 
given above into an O(n?) algorithm to construct a line that simultaneously bisects both 
sets. 

You can also think of the two point sets as a discrete distribution of a ham sandwich 
that is to be cut fairly, that is, in such a way that both parts have the same amount of 
ham and the same amount of bread. That is where the name “ham sandwich cut” comes 
from. The theorem generalizes both to higher dimension and to more general types of 
measures (here we study the discrete setting only where we simply count points). These 
generalizations can be proven using the Borsuk-Ulam Theorem, which states that any 
continuous map from SY to R must map some pair of antipodal points to the same 
point. For a proof of both theorems see, for instance, Matougek's book [8]. 


Theorem 11.18 Let P,,...,P4 CAR be finite sets of points. Then there exists a hy- 
perplane H that simultaneously bisects all of P,,...,Pa. That is, in either open 
halfspace defined by H there are no more than |P;|/2 points from Pi, for every 
i € {1,..., d}. 


This implies that the thieves can fairly distribute a necklace consisting of d types of 
gems using at most d cuts. 

In the plane, a ham sandwich cut can be found in linear time using a sophisticated 
prune and search algorithm by Lo, Matoušek and Steiger [7]. But in higher dimension, 
the algorithmic problem gets harder. In fact, already for R? the complexity of finding a 
ham sandwich cut is wide open: The best algorithm known, from the same paper by Lo 
et al. [7], has runtime O(n°/? log? n/log* n) and no non-trivial lower bound is known. 
If the dimension d is not fixed, it is both NP-hard and W{1]-hard?] in d to decide the 


“Essentially this means that it is unlikely to be solvable in time O(f(d)p(n)), for an arbitrary function 
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following question [6]: Given d € N, finite point sets P,,...,Pa C R4, and a point 
pe ES Pi, is there a ham sandwich cut through p? 


Exercise 11.19 The goal of this exercise 15 to develop a data structure for halfspace 
range counting. 


a) Given a set P C R? of n points in general position, show that it is possible to 
partition this set by two lines such that each region contains at most [7] points. 


b) Design a data structure of size O(n), which can be constructed in time O(n log n) 
and allows you, for any halfspace h, to output the number of points |PMh| of P 
contained in this halfspace h in time O(n), for some0<a<1. 


Exercise 11.20 Prove or disprove the following statement: Given three finite sets 
A,B,C of points in the plane, there is always a circle or a line that bisects A, B and 
C simultaneously (that is, no more than half of the points of each set are inside or 
outside the circle or on etther side of the line, respectively). 


11.8 3-Sum 


The 3-Sum problem is the following: Given a set S of n integers, does there exist a 
three-tuple?] of elements from S that sum up to zero? By testing all three-tuples this 
can obviously be solved in O(n?) time. If the tuples to be tested are picked a bit more 
cleverly, we obtain an O(n?) algorithm. 

Let (si,..., Sn) be the sequence of elements from S in increasing order. Then we test 
the tuples as follows. 


For i =1,... n { 
j=i k=n: 
While k >j { 
If sí; + sj + sk = 0 then exit with triple si, Sj, Sk- 
If si + sj +s, > 0 then k=k-— 1 else j =j+1. 
} 
} 


The runtime is clearly quadratic (initial sorting can be done in O(nlogn) time). 
Regarding the correctness observe that the following is an invariant that holds at the 
start of every iteration of the inner loop: si + sx +s, < 0, for alli < x < j, and 
Soa sj Sy 0, forall k< x 1. 


f and a polynomial p. 
3That is, an element of S may be chosen twice or even three times, although the latter makes sense for 
the number 0 only. :-) 
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Interestingly, this is essentially the best algorithm known for 3-Sum. It is widely 
believed that the problem cannot be solved in sub-quadratic time, but so far this has 
been proved in some very restricted models of computation only, such as the linear 
decision tree model [1]. 


11.9 3-Sum hardness 


There is a whole class of problems that are equivalent to 3-Sum up to sub-quadratic time 
reductions [2]; such problems are referred to as 3-Sum-hard. 


Definition 11.21 A problem P is 3-Sum-hard 2f and only if every instance of 3-Sum 
of size m can be solved using a constant number of instances of P—each of O(n) 
size—and o(n?) additional time. 


For instance, it is not hard to show that the following variation of 3-Sum—let us 
denote it by 3-Sum°—is 3-Sum hard: Given a set S of n integers, does there exist a 
three-element subset of S whose elements sum up to zero? 

As another example, consider the Problem GeomBase: Given n points on the three 
horizontal lines y = 0, y = 1, and y = 2, is there a non-horizontal line that contains at 
least three of them? 

3-Sum can be reduced to GeomBase as follows. For an instance S = [s;,...,Sn) of 
3-Sum, create an instance P of GeomBase in which for each si there are three points in 
P: (s;,0), (—s;/2,1), and (s;,2). If there are any three collinear points in P, there must 
be one from each of the lines y = 0, y = 1, and y = 2. So suppose that p = (s;,0), 
q = (=s;/2,1), and r = (s,,2) are collinear. The inverse slope of the line through p 


and q is =L > “i = —s;/2—s; and the inverse slope of the line through q and r is 
Setsj/? = sx +8;/2. The three points are collinear if and only if the two slopes are equal, 
that is, —sj/2 — si = sk +8;/2 ==> sits) +s, =0. 

A very similar problem is General Position, in which one is given n arbitrary points 
and has to decide whether any three are collinear. For an instance S of 3-Sum”, create 
an instance P of General Position by projecting the numbers s; onto the curve y = x’, 
that is, P = {(a, a) | a e S}. 

Suppose three of the points, say, (a, aî), (b,b*), and (c, c?) are collinear. This is the 
case if and only if the slopes of the lines through each pair of them are equal. (Observe 
that a, b, and c are pairwise distinct.) 


(b?—a®)/(b—a) = (c?—b*)/(c—b) = 
be+a?tab = c?+b?+be => 
b = (c?—a?)/(a—c) => 
b = —(a+c) => 
a+b+ce = 0. 
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Minimum Area Triangle is a strict generalization of General Position and, therefore, also 
3-Sum-hard. 

In Segment Splitting/Separation, we are given a set of n line segments and have to 
decide whether there exists a line that does not intersect any of the segments but splits 
them into two non-empty subsets. To show that this problem is 3-Sum-hard, we can 
use essentially the same reduction as for GeomBase, where we interpret the points along 
the three lines y = 0, y = 1, and y = 2 as sufficiently small “holes”. The parts of the 
lines that remain after punching these holes form the input segments for the Splitting 
problem. Horizontal splits can be prevented by putting constant size gadgets somewhere 
beyond the last holes, see the figure below. The set of input segments for the segment 


splitting problem requires sorting the points along each of the three horizontal lines, 
which can be done in O(nlogn) = o(n?) time. It remains to specify what “sufficiently 
small” means for the size of those holes. As all input numbers are integers, it is not 
hard to see that punching a hole of (x —1/4,x+1/4) around each input point x is small 
enough. 

In Segment Visibility, we are given a set S of n horizontal line segments and two 
segments si,S2 € S. The question is: Are there two points, pı € sı and pa € s2 which 
can see each other, that is, the open line segment pip2 does not intersect any segment 
from S? The reduction from 3-Sum is the same as for Segment Splitting, just put sı 
above and sz below the segments along the three lines. 

In Motion Planning, we are given a robot (line segment), some environment (modeled 
as a set of disjoint line segments), and a source and a target position. The question is: 
Can the robot move (by translation and rotation) from the source to the target position, 
without ever intersecting the “walls” of the environment? 

To show that Motion Planning is 3-Sum-hard, employ the reduction for Segment 
Splitting from above. The three “punched” lines form the doorway between two rooms, 
each modeled by a constant number of segments that cannot be split, similar to the 
boundary gadgets above. 'The source position is in one room, the target position in the 
other, and to get from source to target the robot has to pass through a sequence of three 
collinear holes in the door (suppose the doorway is sufficiently small compared to the 
length of the robot). 


Exercise 11.22 The 3-Sum’ problem is defined as follows: given three sets S,,S2,S3 
of n integers each, are there a; € Sı, az € S2, a3 € S3 such that a, + az + a3 = 0? 
Prove that the 3-Sum’ problem and the 3-Sum problem as defined in the lecture 
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(Sı = S2 = S3) are equivalent, more precisely, that they are reducible to each other 
in subquadratic time. 


Questions 


51. How can one construct an arrangement of lines in R?? Describe the incremen- 
tal algorithm and prove that its time complexity is quadratic in the number of lines 
(incl. statement and proof of the Zone Theorem). 


52. How can one test whether there are three collinear points 1n a set of n given 
points in R?? Describe an O(n?) time algorithm. 


53. How can one compute the minimum area triangle spanned by three out of n 
given points in R?? Describe an O(n?) time algorithm. 


54. What is a ham sandwich cut? Does it always exist? How to compute it? 
State and prove the theorem about the existence of a ham sandwich cut in R? and 
describe an O(n?) algorithm to compute it. 


55. What ts the endpoint visibility graph for a set of disjoint line segments in the 
plane and how can 1t be constructed? Give the definition and explain the relation 
to shortest paths. Describe the O(n?) algorithm by Welzl, including full proofs of 
Theorem |11.11)and Theorem (11.15 


56. Is there a subquadratic algorithm for General Position? Explain the term 
3-Sum hard and its implications and give the reduction from 3-Sum to General 
Position. 


57. Which problems are known to be 3-Sum-hard? List at least three problems 
(other than 3-Sum) and briefly sketch the corresponding reductions. 
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Davenport-Schinzel Sequences 


The complexity of a simple arrangement of n lines in R? is O(n?) and so every algorithm 
that uses such an arrangement explicitly needs O(n?) time. However, there are many 
scenarios in which we do not need the whole arrangement but only some part of it. For 
instance, to construct a ham sandwich cut for two sets of points in R? one needs the 
median levels of the two corresponding line arrangements only. As mentioned in the 
previous section, the relevant information about these levels can actually be obtained in 
linear time. Similarly, in a motion planning problem where the lines are considered as 
obstacles we are only interested in the cell of the arrangement we are located in. There 
is no way to ever reach any other cell, anyway. 

This chapter is concerned with analyzing the complexity—that is, the number of 
vertices and edges—of a single cell in an arrangement of n curves in R?. In case of a 
line arrangement this is mildly interesting only: Every cell is convex and any line can 
appear at most once along the cell boundary. On the other hand, it is easy to construct 
an example in which there is a cell C such that every line appears on the boundary OC. 

But when we consider arrangement of line segments rather than lines, the situation 
changes in a surprising way. Certainly a single segment can appear several times along 
the boundary of a cell, see the example in Figure Make a guess: What is the 
maximal complexity of a cell in an arrangement of n line segments in R?? 


ES 


Figure 12.1: A single cell in an arrangement of line segments. 
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You will find out the correct answer soon, although we will not prove it here. But 
my guess would be that it is rather unlikely that your guess is correct, unless, of course, 
you knew the answer already. :-) 

For a start we will focus on one particular cell of any arrangement that is very easy to 
describe: the lower envelope or, intuitively, everything that can be seen vertically from 
below. To analyze the complexity of lower envelopes we use a combinatorial descrip- 
tion using strings with forbidden subsequences, so-called Davenport-Schinzel sequences. 
These sequences are of independent interest, as they appear in a number of combinatorial 
problems and in the analysis of data structures [7]. The techniques used apply not 
only to lower envelopes but also to arbitrary cells of arrangements. 


Definition 12.1 An (n,s)-Davenport-Schinzel sequence, for n,s € IN, is a sequence over 
an alphabet A of size n in which 


e no two consecutive characters are the same and 


e there is no alternating subsequence of the form...a...b...a...b... of s+ 2 
characters, for any a,b E€ A. 


Let A (n) be the length of a longest (n, s)-Davenport-Schinzel sequence. 


For example, abcbacb is a (3, 4)-DS sequence but not a (3,3)-DS sequence because it 
contains the subsequence bcbcb. 


Proposition 12.2 A (m) +A,(n) <A,(m+n). 


Proof. On the left hand side, we consider two Davenport-Schinzel sequences, one over 
an alphabet A of size m and another over an alphabet B of size n. We may suppose that 
AB = / (for each character x € AM B introduce a new character x’ and replace all 
occurrences of x in the second sequence by x’). Concatenating both sequences yields a 
Davenport-Schinzel sequence over the alphabet A UB of size m+n. O 


Let us now see how Davenport-Schinzel sequences are connected to lower envelopes. 
Consider a collection F = {f,,..., fn} of real-valued continuous functions that are defined 
on a common interval I C R. The lower envelope L5 of F is defined as the pointwise 
minimum of the functions fi, 1 < i < n, over I. Suppose that any pair fi, fj, 1 <i< 
j < n, intersects in at most s points. Then I can be decomposed into a finite sequence 
I,,..., I, of (maximal connected) pieces on each of which a single function from F defines 
Lz. Define the sequence p(F) = (d1,..., Pg), where fy, is the function from F which 
defines £ on l.;. 


Observation 12.3 (F) is an (n, s)-Davenport-Schinzel sequence. 


In the case of line segments the above statement does not hold because a set of line 
segments is in general not defined on a common real interval. 
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Proposition 12.4 Let F be a collection of n real-valued continuous functions, each of 
which 1s defined on some real interval. If any two functions from F intersect in at 
most s points then p(F) is an (n,s + 2)-Davenport-Schinzel sequence. 


Proof. Let us first argue that we may suppose that the functions in F are piecewise 
quadratic. Denote by P the set of points that are vertices of the arrangement induced by 
the graphs of the functions in F. In other words, P is the set of all endpoints of functions 
and intersection points of two functions from F. Let x1,...,Xm denote the sequence of 
x-coordinates of points from P, sorted in increasing order. 

Consider the set $; C F of functions that are defined on the interval [x;, Xi+1], for 
i € {1,...,m — 1}. By definition of P, no two functions intersect within the interval 
(Xi, Xi41), that is, throughout (xi, Xi+41) the functions from F; maintain the same total 
(y-)order. We describe how to replace each function in $; on [xi,xi+1] by a quadratic 
function such that this order and therefore the combinatorics of the induced arrangement 
remains unchanged. 

Consider a function f € Fi. If f is the only function in $; that passes through both 
points (xi, f(xi)) and (x;¡+1, f(xi;1)), then replace f by the line segment connecting these 
two points. Otherwise, consider the set F of all functions in $; that pass through both 
points, and replace each function by a parabolic arc connecting the two points. These 
parabolic arcs can be chosen so that they are pairwise disjoint within the open interval 
(xi, Xi+1) and all of them are infinitesimally close to the line segment between (xi, f(x;)) 
and (x41, f(xi+41)), while maintaining the total order among the functions in F within the 
interval (x;,x;,1). The described replacement does not introduce any new intersection 
(that is why we used different parabolic arcs) and maintains the order of functions at 
any x-coordinate. In particular, p(F) remains the same. 

Now we are ready to prove the actual statement. The idea is to extend the func- 
tions to be defined on all of R, without changing (F) too much, and then resort to 
Observation [12.3 

Consider any function f € F defined on an interval [a, b]. Extend f to R using almost 
vertical rays pointing upward; from a use a ray of sufficiently small slope, from b use a 
ray of sufficiently large slope. For all functions use the same slope on these two extensions 
such that no extensions in the same direction intersect. By sufficiently small/large we 
mean that for any extension ray there is no endpoint of a function nor an intersection 
point of two functions in the open angular wedge bounded by the extension ray and the 
vertical ray starting from the same source. (There may be such points on the vertical 
ray, but not in the open wedge between the two rays.) As any two functions intersect 
only a finite number of times, there is only a finite number of endpoints and intersection 
points to consider, and so “sufficiently small/large” is well-defined. 

Denote the resulting collection of functions totally defined on R by F’. If the extension 
rays are sufficiently close to vertical then p(F’) = o(F). (Recall that by our reasoning 
from above we may regard each function as a parabolic arc or a line segment locally.) 

For any f € F’ a single extension ray can create at most one additional intersection 
with any g € F’. (Let las, bf] and [ag, bg] be the intervals on which the function f and 
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g, respectively, was defined originally. Consider the ray r extending f from as to the left. 
If as € [dg, bg] then r may create a new intersection with g, if af > bg then r creates a 
new intersection with the right extension of g from bg, and if ar < ay then r does not 
create any new intersection with g.) 

On the other hand, for any pair s,t of segments, neither the left extension of the 
leftmost segment endpoint nor the right extension of the rightmost segment endpoint 
can introduce an additional intersection. Therefore, any pair of segments in F’ intersects 
at most s + 2 times and the claim follows. O 


Next we will give an upper bound on the length of Davenport-Schinzel sequences for 
small s. 


Lemma 12.5 à (n) =n, (n) = 2n — 1, and Az(n) < 2n(1 + logn). 


Proof. Ai(n) = n is obvious. a(n) = 2n — 1 is given as an exercise. We prove 
A3(n) < 2n(1 + logn) = O(nlogn). 

For n = 1 it is A3(1) = 1 < 2. For n > 1 consider any (n, 3)-DS sequence o of length 
A3(n). Let a be a character that appears least frequently in o. Clearly a appears at 
most A3(n)/n times in ø. Delete all appearances of a from o to obtain a sequence o’ on 
n—1 symbols. But o” is not necessarily a DS sequence because there may be consecutive 
appearances of a character b in o”, in case that o=...bab.... 

Clam: There are at most two pairs of consecutive appearances of the same char- 
acter in o’. Indeed, such a pair can be created around the first and last appearance 
of a in o only. If any intermediate appearance of a creates a pair bb in o” then 
o=...a...bab...a..., in contradiction to o being an (n, 3)-DS sequence. 

Therefore, one can remove at most two characters from o” to obtain a (n — 1, 3)-DS 
sequence O. As the length of O is bounded by A3(n — 1), we obtain A3(n) < A3(n — 1) + 
A3(n)/n+ 2. Reformulating yields 


A A —1 2 1 
eu evita <1 42) 7=142Hn1 
n n-—1 n-1 ` œ~ — i 
>! > =f(1) i=l 
=:f(n) =f(n-1) 
and together with 2H,_; < 1+2logn we obtain A3(n) < 2n(1 + logn). O 
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Bounds for higher-order Davenport-Schinzel sequences. As we have seen, A(n) (no aba) 
and A2(n) (no abab) are both linear in n. It turns out that for s > 3, As(n) is slightly 
superlinear in n (taking s fixed). The bounds are known almost exactly, and they involve 
the inverse Ackermann function a(n), a function that grows extremely slowly. 

To define the inverse Ackermann function, we first define a hierarchy of functions 
X(N), (n), s(n), ... where, for every fixed k, x(n) grows much more slowly than 
ax-—1(n): 

We first let x,(n) = [n/2]. Then, for each k > 2, we define a (n) to be the number 
of times we must apply a,_1, starting from n, until we get a result not larger than 1. In 
other words, a(n) is defined recursively by: 


(n) 0, ifn < 1; 
xkn) = 
1+ a,(&.1(n)), otherwise. 


Thus, %2(n) = [logan], and a3(n) = log“ n. 

Now fix n, and consider the sequence «;(n), X2(n), s(n), .... For every fixed n, this 
sequence decreases rapidly until it settles at 3. We define a(n) (the inverse Ackermann 
function) as the function that, given n, returns the smallest k such that a (n) is at most 
3: 


a(n) = min {k | o (n) < 3). 


We leave as an exercise to show that for every fixed k we have a (n) = o(&k-ı(n)) 
and a(n) = 0(a;,(n)). 

Coming back to the bounds for Davenport-Schinzel sequences, for A3(n) (no ababa) 
it is known that A3(n) = O(na(n)) [4]. In fact it is known that A3(n) = 2na(n) + 
O(nya(n)) [5] [6]. For A4(n) (no ababab) we have à4(n) = O(n - 241) [3]. 

For higher-order sequences the known upper and lower bounds are almost tight, and 
they are of the form A,(n) = n - 2P°y(*(")) where the degree of the polynomial in the 
exponent is roughly s/2 [3] [6]. 


Realizing DS sequences as lower envelopes. There exists a construction of a set of n seg- 
ments in the plane whose lower-envelope sequence has length O(na(n)). (In fact, the 
lower-envelope sequence has length na(n) — O(n), with a leading coefficient of 1; it is 
an open problem to get a leading coefficient of 2, or prove that this is not possible.) 

It is an open problem to construct a set of n parabolic arcs in the plane whose 
lower-envelope sequence has length O(n - 2%), 


Generalizations of DS sequences. Also generalizations of Davenport-Schinzel sequences 
have been studied, for instance, where arbitrary subsequences (not necessarily an al- 
ternating pattern) are forbidden. For a word o and n € IN define Ex(o,n) to be the 
maximum length of a word over A = {1,...,n}* that does not contain a subsequence of 
the form o. For example, Ex(ababa,n) = A3(n). If o consists of two letters only, say a 
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and b, then Ex(o, n) is super-linear if and only if ø contains ababa as a subsequence [1]. 
This highlights that the alternating forbidden pattern is of particular interest. 


Exercise 12.6 Prove that A2(n) = 2n—1. 


Exercise 12.7 Prove that A,(n) is finite for all s andn. 


12.1 Constructing lower envelopes 


Theorem 12.8 Let F = {f,,...,fn} be a collection of real-valued continuous functions 
defined on a common interval I C R such that no two functions from F intersect in 
more than s points. Then the lower envelope Ly can be constructed in O(A,;(n) log n) 
time. (Assuming that an intersection between any two functions can be constructed 
in constant time.) 


Proof. Divide and conquer. For simplicity, assume that n is a power of two. Split F 
into two equal parts F, and F and construct Ly, and Ly, recursively. The resulting 
envelopes can be merged using line sweep by processing 2A,(n/2)+A,(n) < 2A,(n) events 
(the inequality 2A,(n/2) < As(n) is by Proposition |12.2). Here the first term accounts 
for events generated by the vertices of the two envelopes to be merged. The second 
term accounts for their intersections, each of which generates a vertex of the resulting 
envelope. Observe that no sorting is required and the SLS structure is of constant size. 
Therefore, the sweep can be done in time linear in the number of events. 

This yields the following recursion for the runtime T(n) of the algorithm. T(n) < 
2T(n/2) + cAs(n), for some constant c € IN. Using Proposition it follows that 
T(n) < cD Pep As (n/2!) < cP PEP As(n) = O(As(n) logn). O 


Exercise 12.9 Show that every (n, s)-Davenport-Schinzel sequence can be realized as 
the lower envelope of n continuous functions from R to R, every pair of which 
intersect at most s times. 


Exercise 12.10 Show that every Davenport-Schinzel sequence of order two can be re- 
alized as a lower envelope of n parabolas. 


It is a direct consequence of the symmetry in the definition that the property of being 
a Davenport-Schinzel sequence is invariant under permutations of the alphabet. For 
instance, o = bcacba is a (3, 3)-DS sequence over A = {a,b,c}. Hence the permutation 
n = (ab) induces a (3, 3)-DS sequence n(o) = acbcab and similarly n’ = (cba) induces 
another (3,3)-DS sequence 7t’(o) = abcbac. 

When counting the number of Davenport-Schinzel sequences of a certain type we 
want to count essentially distinct sequences only. Therefore we call two sequences over 
a common alphabet A equzvalent if and only if one can be obtained from the other 
by a permutation of A. Then two sequences are distinct if and only if they are not 
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equivalent. A typical way to select a representative from each equivalence class is to 
order the alphabet and demand that the first appearance of a symbol in the sequence 
follows that order. For example, ordering A = {a, b,c} alphabetically demands that the 
first occurrence of a precedes the first occurrence of b, which in turn precedes the first 
occurrence of c. 


Exercise 12.11 Let P be a convex polygon with n+1 vertices. Find a bijection between 
the triangulations of P and the set of pairwise distinct (n,2)-Davenport-Schinzel 
sequences of maximum length (2n — 1). It follows that the number of distinct 
maximum (n, 2)-Davenport-Schinzel sequences 1s exactly Cy_1 = an?) which 1s 


nin-—1 
the (n — 1)-st Catalan number. 


Questions 


58. What is an (n,s) Davenport-Schinzel sequence and how does it relate to the 
lower envelope of real-valued continuous functions? Give the precise definition 
and some examples. Explain the relationship to lower envelopes and how to apply 
the machinery to partial functions like line segments. 


59. What is the value of ^ (n) and A2(n)? 
60. What is the asymptotic value of A3(n), Aa(n), and A,(n) for larger s? 


61. What is the combinatorial complexity of the lower envelope of a set of n 
lines/parabolas/line segments? 
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Chapter 13 


Epsilon Nets 


13.1 Motivation 


Here is our scenario for this chapter. We are given a set A of points in Rt and a family 
R of ranges r C R4, for example the set of all balls, halfspaces, or convex sets in Rê. 
A is huge and probably not even known completely; similarly, R may not be accessible 
explicitly (in the examples above, it is an uncountable set). Still, we want to learn 
something about A and some r € R. 

The situation is familiar, definitely, if we don’t insist on the geometric setting. For 
example, let A be the set of consumers living in Switzerland, and let Y be the subset 
of consumers who frequently eat a certain food product, say Lindt chocolate. We have 
similar subsets for other food products, and together, they form the family of ranges R. 

If we want to learn something about f, e.g. the ratio a (the fraction of consumers 
frequently eating Lindt chocolate), then we typically sample a subset S of A and see 
what portion of S lies in Y. We want to believe that 


Bie! approximates Jl 
IS] Al’ 


and statistics tells us to what extent this is justified. In fact, consumer surveys are based 
on this approach: in our example, S is a sample of consumers who are being asked about 
their chocolate preferences. After this, the quantity |? N S|/|S| is known and used to 
predict the “popularity” |7|/|A| of Lindt chocolate among Swiss consumers. 

In this chapter, we consider a different kind of approximation. Suppose that we 
are interested in the most popular food products in Switzerland, the ones which are 
frequently eaten by more than an e-fraction of all consumers, for some fixed 0 < e < 1. 
The goal is to find a small subset N of consumers that “represent” all popular products. 
Formally, we want to find a set N C A such that 


for all r: > = rin Au 
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Such a subset is called an epsilon net. Obviously, N = A is an epsilon net for all e, but 
as already mentioned above, the point here is to have a small set N. 

Epsilon nets are very useful in many contexts that we won't discuss here. But al- 
ready in the food consumption example above, it is clear that a small representative 
set of consumers is a good thing to have; for example if you quickly need a statement 
about a particular popular food product, you know that you will find somebody in your 
representative set who knows the product. 

The material of this chapter is classic and goes back to Haussler and Welz! [1]. 


13.2 Range spaces and e-nets. 


Here is the formal framework. Let X be a (possibly infinite) set and R C 2X. The pair 
(X, R) is called a range space" | with X its points and the elements of R its ranges. 


Definition 13.1 Let (X,R) be a range space. Given AC X, finite, ande €R,0O<e<l, 
a subset N of A is called an e-net of A (w.r.t. R) if 


forallreR: IrnmAl>elAl => rnNxX0/. 


This definition is easy to write down, but it is not so easy to grasp, and this is why we 
will go through a couple of examples below. Note that we have a slightly more general 
setup here, compared to the motivating Section where we had X = A. 


13.2.1 Examples 


Typical examples of range spaces in our geometric context are 


e (R,H,) with H := [(—oo, alla € R)U (la, oo)|a € R} (half-infinite intervals), 
and 


e (R,J) with J := (la, b]|| a,b € R, a < b} (intervals), 
and higher-dimensional counter-parts 
e (R4, Ka) with Ha the closed halfspaces in R* bounded by hyperplanes, 
e (Rt, Ba) with Ba the closed balls in R4, 
e (Rt, Sa) with Sa the d-dimensional simplices in Rt, and 


e (Rt, Ca) with Ca the convez sets in Rê. 


lIn order to avoid confusion: A range space is nothing else but a set system, sometimes also called 
hypergraph. It is the context, where we think of X as points and R as ranges in some geometric ambient 
space, that suggests the name at hand. 
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e-Nets w.r.t. (R, Kı) are particularly simple to obtain. For A C R, N := (min A, max A} 
is an e-net for every e—it is even a O-net. That is, there are e-nets of size 2, independent 
from |A| and e. 

The situation gets slightly more interesting for the range space (IR, J) with intervals. 
Given e and A with elements 


Qai <A <: <An, 
we observe that an e-net must contain at least one element from any contiguous sequence 


(ai, Aiy1,+--., Gizk-1) Of k > en (ie. k > |en] + 1) elements in A. In fact, this is a 
necessary and sufficient condition for e-nets w.r.t. intervals. Hence, 


LA eng Dspen jaa) 


is an e-net of ‘al || < [>] 1. So while the size of the net depends now on e, 
it is still independent of |A]. 


13.2.2 No point in a large range. 


Let us start with a simple exercise, showing that large ranges are easy to “catch”. Assume 
that |r Al > e|A| for some fixed r and e, 0<e€<l. 

Now consider the set S obtained by drawing s elements uniformly at random from A 
(with replacement). We write 


S~ AS, 


indicating that S is chosen uniformly at random from the set A* of s-element sequences 
over A. 
What is the probability that S ~ AS fails to intersect with r, ie. SM r= (? For 


— OA (note p > £) we get) 


prob(S Nr =) = (1— p) < (1— €) < e™®5 . 


That is, if s = 1 then this probability is at most e 1 ~ 0.368, and if we choose s = Al, 
then this probability decreases exponentially with A: It is at most e~*. 
For example, if |A] = 10000 and |r N A| > 100 (r contains more than 1% of the points 


in A), then a sample of 300 points is disjoint from r with probability at most e”? ~ 0.05. 


2The number L of elements in the set is the largest £ such that ((|en] +1) <n, hence L = la] 
< +, and so L < lie. L< [2] —1. 


Since |en] +1 > en, we have ja a 


3We make use of the inequality 1 + x < e* for all x € R. 
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13.2.3 Smallest enclosing balls. 


Here is a potential use of this for a geometric problem. Suppose A is a set of n points 
in R*, and we want to compute the smallest enclosing ball of A. In fact, we are willing 
to accept some mistake, in that, for some given e, we want a small ball that contains all 
but at most en points from A. So let's choose a sample S of Al points drawn uniformly 
(with replacement) from A and compute the smallest enclosing ball B of S. Now let 
r := R4 \ B, the complement of B in R4, play the role of the range in the analysis above. 
Obviously r N S = 0, so it is unlikely that |r N A| > |A|, since—if so—the probability of 


SNr = Í was at most e. 


It 1s important to understand that this was complete nonsense! 


For the probabilistic analysis above we have to first choose r and then draw the 
sample—and not, as done in the smallest ball example, first draw the sample and then 
choose r based on the sample. That cannot possibly work, since we could always choose 
r simply as the complement Rt \ S—then clearly r N S = @ and |r N A] > e|Al, unless 
|S] > (1 — £)|Al. 

While you hopefully agree on this, you might find the counterargument with r = RUS 
somewhat artificial, e.g. complements of balls cannot be that selective in ‘extracting’ 
points from A. It is exactly the purpose of this chapter to understand to what extent 
this advocated intuition is justified or not. 


13.3 Either almost all is needed or a constant suffices. 


Let us reveal the spectrum of possibilities right away, although its proof will have to 
await some preparatory steps. 


Theorem 13.2 Let (X, R) be an infinite range space. Then one of the following two 
statements holds. 


(1) For every n € N there is a set A, C X with |A,| = n such that for every e, 
0O<e<l1, an e-net must have size at least (1—€)n. 


(2) There is a constant 5 depending on (X,R), such that for every finite A C X 
and every £, O < e < 1, there ts an e-net of A w.r.t. R of size at most 85 log, 48 
(independent of the size of A). 


That is, either we have always e-nets of size independent of |A|, or we have to do the 
trivial thing, namely choosing all but en points for an e-net. Obviously, the range spaces 
(R, Hı) and (RR, J) fall into category (2) of the theorem. 

For an example for (1), consider (IR?, C,). For any n € N, let An be a set of n points 
in convex position. For every N C An there is a range r € C, namely the convex hull 
of An \ N, such that An Nr = An \ N (hence, rm N = ()); see Figure Therefore, 
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N CA, cannot be an e-net of An w.r.t. Co if |A, A N| =n—|N| > en. Consequently, an 
e-net must contain at least n — en =(1—e)n points] 


Figure 13.1: If R consists of all convex sets in the plane, then only trivial epsilon nets 
exist: for every subset N (black points) of a set An in convex position, 
the range r =conv(A, \ N) fails to intersect N. 


So what distinguishes (IR?, C2) from (R, Hı) and (R,J)? And which of the two cases 
applies to the many other range spaces we have listed above? Will all of this eventually 
tell us something about our attempt of computing a small ball covering all but at most 
en out of n given points? This and more should be clear by the end of this chapter. 


13.4 What makes the difference: VC-dimension 
Given a range space (X, R) and A C X, we let 
Ria ={rNnAlre R}, 


the projection of R to A. Even if R is infinite, R|, is always of size at most 2" if A 
is an n-element set. The significance of projections in our context becomes clear if we 
rewrite Definition in terms of projections: N C A is an e-net if 


forallre Rla: Irl>elAl => rnaNzZ0. 


All of a sudden, the conditions for an e-net have become discrete, and they only depend 
on the finite range space (A, Rl, ). 


“If we were satisfied with any abstract example for category (1), we could have taken (X, 2*) for any 
infinite set X. 
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Note that, for A a set of n points in convex position in the plane, Cal, = 2%; we 
get every subset of A by an intersection with a convex set (this is also the message of 
Figure [13.1). That is | Co], | = 2", the highest possible value. 

For A a set of n points in R, we can easily see that] | Ma l= ue +1=O(n?). A 
similar argument shows that | H,|, | = 2n. Now comes the crucial definition. 


Definition 13.3 Given a range space (X,R), a subset A of X is shattered by R if Ri, = 
2^. The VC-dimensior?] of (X, R), VCdim(X,R), is the cardinality (possibly infinite) 
of the largest subset of X that is shattered by R. If no set is shattered (i.e. not even 
the empty set which means that R is empty), we set the VC-dimension to —1. 


We had just convinced ourselves that (IR?, C2) has arbitrarily large sets that can be 
shattered. Therefore, VCdim(IR?, C2) = oo. 

Consider now (R,J). Two points A = {a,b} can be shattered, since for each of the 
4 subsets, Ø, {a}, {b}, and {a, b}, of A, there is an interval that generates that subset by 
intersection with A. However, for A = {a,b,c} with a < b < c there is no interval that 
contains a and c but not b. Hence, VCdim(R,J) = 2. 


Exercise 13.4 What 1s VCdim(R, H1)? 


Exercise 13.5 Prove that if VCdim(X,R) = œ, then we are in case (1) of Theo- 
rem|13.2, meaning that only trivial epsilon nets always exist. 


13.4.1 The size of projections for finite VC-dimension. 


Here is the (for our purposes) most important consequence of finite VC dimension: there 
are only polynomially many ranges in every projection. 


Lemma 13.6 (Sauer’s Lemma) Jf (X, R) is a range space of finite VC-dimension at most 
5, then 


5 
Ral < Os(n) = Y" (+) 


i=0 


for all AC X with |A| =n. 


5Given A as a1 < ag--: < an we can choose another n+ 1 points bi, O < i < n, such that 
bo < ay < bı < a2 < b2 < :-:bn—1 < An < bn . 
Each nonempty intersection of A with an interval can be uniquely written as AN [b;, b;] for0 Si <j <n. 
This gives To plus one for the empty set. 


&VC’ in honor of the Russian statisticians V. N. Vapnik and A. Ya. Chervonenkis, who discovered the 
crucial role of this parameter in the late sixties. 
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Proof. First let us observe that ® : No U{—1} x No > No is defined by the recurrencg”] 


0 el 
Os(n)=4<4 1 n=0and ô > 0, and 
Os(n—1)+ 05 ¡(n—1) otherwise. 


Second, we note that the VC-dimension cannot increase by passing from (X, R) to a 
projection (A, R), R:= R|,. Hence, it suffices to consider the finite range space (A, R)— 
which is of VC-dimension at most ¿—and show |R| < Os(n) (since ® is monotone in 
5). 

Now we proceed to a proof by induction of this inequality. If A = Ø or R = @ the 
statement is trivial. Otherwise, we consider the two ‘derived’ range spaces for some fixed 
xEA: 

(A \{x},R—x), with R—x:={r\{x}|r € R} 


(note R—x = Rla(,,) and 
(A\{x}R™), with R™ := {r € R|x ¢1,rU {x} e R} 


Observe that the ranges in R™ are exactly those ranges in R—x that have two preimages 
under the map 
R> ro r\{x} €R-x, 


all other ranges have a unique preimage. Consequently, 
IR] = IR — x| + IR]. 


We have |R—x| < Os(n—1). IfA’ C A\{x} is shattered by R™ , then A’U{x} is shattered 
by R. Hence, (A \ {x}, R) has VC-dimension at most ¿— 1 and |R")| < ®5—ı(n — 1). 
Summing up, it follows that 


R| < Os(n — 1) + 05_¡(n— 1) = 05(n) 
which yields the assertion of the lemma. O 


In order to see that the bound given in the lemma is tight, consider the range space 


Luo) 


Obviously, a set of more than 5 elements cannot be shattered (hence, the VC-dimension is 
at most 5), and for any finite A C X, the projection of the ranges to A is Cae (2) —with 
cardinality @;(|A|). 


“We recall that the binomial coefficients (3) (with k,n € No) satisfy the recurrence (9) =0ifn<k, 
(5) = 1, and (p) = (1,0) + (Ki): 
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We note that a rough, but for our purposes good enough estimate for @ is given byf] 
O;(n) <n° for 5 >2. 


We have seen now that the maximum possible size of projections either grows expo- 
nentially (2” in case of infinite VC-dimension) or it is bounded by a polynomial n° in 
case of finite VC-dimension ô). The latter is the key to the existence of small e-nets. 
Before shedding light on this, let us better understand when the VC-dimension is finite. 


13.5 VC-dimension of Geometric Range Spaces 


13.5.1 Halfspaces. 


Let us investigate the VC-dimension of (IR?, HK,). It is easily seen that three points in the 
plane can be shattered by halfplanes, as long a they do not lie on a common line. Hence, 
the VC-dimension is at least 3. Now consider 4 points. If three of them lie on a common 
line, there is no way to separate the middle point on this line from the other two by a 
halfplane. So let us assume that no three points lie on a line. Hither three of them are 
vertices of a triangle that contains the fourth point—then we cannot possibly separate 
the fourth point from the remaining three points by a halfplane. Or the four points are 
vertices of a convex quadrilateral—then there is no way of separating the endpoints from 
a diagonal from the other two points. Consequently, four points cannot be shattered, 
and VCdim(R?, K2) = 3 is established. 

The above argument gets tedious in higher dimensions, if it works in a rigorous way 
at all. Fortunately, we can employ a classic: By Radon’s Lemma (Theorem [3.8), every 
set A of > d +2 points in Rt can be partitioned into two disjoint subsets A, and Az 
such that conv(A,;) N conv(A2) 4 Ø. We get, as an easy implication, that a set A of 
at least d + 2 points in Rt cannot be shattered by halfspaces. Indeed, let A, U A3 
be a partition as guaranteed by Radon’s Lemma. Now every halfspace containing A, 
must contain at least one point of Az, hence hN A = A, is impossible for a halfspace 
h and thus A is not shattered by Ha. Moreover, it is easily seen that the vertex set of 
a d-dimensional simplex (there are d+ 1 vertices) can be shattered by halfspaces (each 
subset of the vertices forms a face of the simplex and can thus be separated from the 
rest by a hyperplane). We summarize that 


VCdim(R*, Ha) =d+1. 


Let us now consider the range space (R$, Ka), where Ha denotes the set of all closed 
halfspaces below non-vertical hyperplanes?we call these lower halfspaces. Since Ha C 


8A better estimate, at least for 5 > 3, is given by Ds(n) < pa for aln,deN, d<n. 
°A hyperplane is called non-vertical if it can be specified by a linear equation = 4 Aixi = Aa+1 With 


Aa Æ 0; see also Section [1.2] 
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Ha, the VC-dimension of (R4, Ha) is at most d + 1, but, in fact, it not too difficult to 
show 


VCdim(R*, Ha) =d. (13.7) 


(Check this claim at least for d = 2.) This range space is a geometric example where the 
bound of Sauer’s Lemma is attained. Indeed, for any set A of n points in Rt in general 
position >| it can be shown that 


[žala] = Palm): 


13.5.2 Balls. 


It is easy to convince oneself that the VC-dimension of disks in the plane is 3: Three 
points not on a line can be shattered and four points cannot. Obviously not, if one of the 
points is in the convex hull of the other, and for four vertices of a convex quadrilateral, 
it is not possible for both diagonals to be separated from the endpoints of the respective 
other diagonal by a circle (if you try to draw a picture of this, you see that you get two 
circles that intersect four times, which we know is not be the case). 

A more rigorous argument which works in all dimensions is looming with the help of 
(13.7), if we employ the following transformation called lifting map that we have already 
encountered for d = 2 in Section 


Rå —s Rått 
(iy X2,- --, Xa) =P > L(p) =(x1,X2,...,Xa, X17 + X2 +--+ Exa”) 


(For a geometric interpretation, this is a vertical projection of R4 to the unit paraboloid 
Xai = X1? + Xo? +++: + xa? in R¢*!.) The remarkable property of this transformation 
is that it maps balls in Rt to halfspaces in R“** in the following sense. 

Consider a ball Ba(c, p) (c = (c1, C2,-..,Ca) E€ R4 the center, and p € R+ the radius). 
A point p = (x1,X2,...,Xa) lies in this ball if and only if 


d 
> a —&)? < p? e > (ee — 2xici + ci?) < p? 
i=l 


i=1 
d d 
> Y ac) +(x +x t exa) <p? Y c?; 
i=1 i=1 


this equivalently means that ((p) lies below the non-vertical hyperplane (in R+!) 


d+1 
y hix; = haz | with 


i=1 


h = h(c, p) = $ Ene 


d 
(ha, ha, ... ha, ha+1; ha+2) = (2e (—2c2), ...) (—2ca), t p? = E 00) . 
i=l 


10No i+ 2 on a common i-flat for i € (1,2,...,d— 1); in particular, no d + 1 points on a common 
hyperplane. 
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It follows that a set A C Rê is shattered by Ba (the set of closed balls in Ra) if and only 
if (A) :=(L(p) |p € A} is shattered by Hay1. Assuming (13.7), this readily yields 


VCdim(R4, Ba) =d+1. 


The lifting map we have employed here is a special case of a more general paradigm 
called linearization which maps non-linear conditions to linear conditions in higher 
dimensions. 

We have clarified the VC-dimension for all examples of range spaces that we have 
listed in Section except for the one involving simplices. The following exercise 
should help to provide an answer. 


Exercise 13.8 For a range space (X, R) of VC-dimension d and a number k € N, 
consider the range space (X,J) where ranges form intersections of at most k ranges 
from R, that is, J = qe Ri |Vi : Ri € R}. Give an upper bound for the VC- 
dimension of (X, J). 


13.6 Small e-Nets, an Easy Warm-up Version 


Let us prove a first bound on the size of e-nets when the VC-dimension is finite. 


Theorem 13.9 Let n € IN and (X,R) be a range space of VC-dimension d > 2. Then 
for any A C X with |A| =n and any e € R* there exists an e-net N of A w.r.t. R 
with |N| < pen], 
Proof. We restrict our attention to the finite projected range space (A,R), R := Rla, 
for which we know |R| < Da(n) < n°. It suffices to show that there is a set N C A with 
IN| < alan which contains an element from each r € R; := {r € R: |r| > en). 

Suppose, for some s € N (to be determined), we let N ~ A*. For each r € R¿, we 
know that prob(r N N = Ø) < (1— e)? < e **. Therefore, 


prob(N is not e-net of A) = prob(3reR.:rnmN =f) 
= prob( V (ra N =0)) 


TER 


< ys prob(r N N =@) < [R¿Je”** < nde7*s . 
TER, 


It follows that if s is chosen so that n“e ** < 1, then prob(N is not e-net of A) < 1 and 
there remains a positive probability for the event that N is an e-net of A. Now 


nte <1 eo ni<e® a dinn<es. 
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That is, for s = p22"), the probability of obtaining an e-net is positive, and therefore 
an e-net of that size has to exist £] O 


If we are willing to invest a little more in the size of the random sample N, then the 
probability of being an e-net grows dramatically. More specifically, for s = p Lenta, we 
have 


and, therefore, a sample of that size is an e-net with probability at least 1—e~. 


We realize that we need dian sample size to compensate for the (at most) nt subsets 
of A which we have to hit—it suffices to ensure positive success probability. The extra 
A allows us to boost the success probability. 

Also note that if A were shattered by R, then R = 2^ and |R| = 2". Using this bound 
instead of nt in the proof above would require us to choose s to be roughly 2292 a 


E ? 
useless estimate which even exceeds n unless e is large (at least In 2 ~ 0.69). 


13.6.1 Smallest enclosing balls, again 


It is time to rehabilitate ourselves a bit concerning the suggested procedure for computing 
a small ball containing all but at most e|A| points from an n-point set A CIR. 

Let (R4, Ba") be the range space whose ranges consist of all complements of closed 
balls in Rt. This has the same VC-dimension d+1 as (Rt, Ba). Indeed, if ANT = A’ for 
a ball r, then AN (R*\r) =A\A’, so A is shattered by Ba if and only if A is shattered 
by ge 

Hence, if we choose a sample N of size s = A 
at least 1 — e this is an e-net for A w.r.t. B¿*P. Let us quickly recall what this 
means: whenever the complement of some ball B has empty intersection with N, then 
this complement contains at most an e|A| points of A. As a consequence, the smallest 


ball enclosing N has at most ¢|A| points of A outside, with probability at least 1 — e™^. 


pee ey then, with probability 


13.7 Even Smaller -Nets 


We still need to prove part 2 of Theorem the existence of e-nets whose size is 
independent of A. For this, we employ the same strategy as in the previous section, 1.e. 
we sample elements from A uniformly at random, with replacement; a refined analysis 
will show that—compared to the bound of Theorem [13.9much less elements suffice. 
Here is the main technical lemma. 


11 This line of argument “If an experiment produces a certain object with positive probability, then it has 
to exist”, as trivial as it is, admittedly needs some time to digest. It is called The Probabilistic Method, 
and was used and developed to an amazing extent by the famous Hungarian mathematician Paul Erdós 
starting in the thirties. 


167 


Chapter 13. Epsilon Nets CG 2013 


Lemma 13.10 Let (X,R) be a range space of VC-dimension 5 > 2, and let A C X be 
finite. Ifx ~A™ form > 8/e, then for the set Nx of elements occurring in x, we 
have 


em 


prob(N, zs not an e-net for A w.r.t. R) <205(2m)2 2 . 


Before we prove this, let us derive the bound of Theorem from it. Let us recall 
the statement. 


Theorem (2). Let (X, R) be a range space with VC-dimension 5 > 2. Then for every 
finite A C X and every e, 0 < e < 1, there is an e-net of A w.r.t. R of size at most 
88 log, a8 (independent of the size of A). 
We want that 
20;(2m)2-2 <1, 

since then we know that an e-net of size m exists. We have 

20;(2m)2 77 
<= 22m)? <27 


<1 


E 
+ 1+8log,(2m) < = 


= 2log, m < a 
45 m 


=> < : 
e€ logam 


In the second to last implication, we have used ô > 2 and m > 8. Now we claim that 


the latter inequality is satisfied for m = 22 log, 2. To see this, we need that um >x 


for m = 2a log, x and a = s, We compute 


m 2a log, x 2a log, x 2 log, x 


= = = > 
logm  log,(2a1lo0g, a) 1+ log, ax + log, log, « i + log, x + log, log, x a 
as long as 


log, xa > 1 + log, log, x > log, a > log, 2log, x = a > 2log, a, 


which holds as long as «œ > 2, and this is satisfied for ax = s, 


Proof. (Lemma|13.10) By going to the projection (A, R|, ), we may assume that X = A, 
so we have a finite range space over n elements (see Section |13.4). Fix e € R*. For 
t € N, a range r € R and x € At (a t-vector of elements from A), we define 


count(r, x) = Hi € [t]: xi € r}. 
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Now we consider two events over A?™. The first one is the bad event of not getting 
an e-net when we choose m elements at random from A (plus another m elements that 
we ignore for the moment): 


Q := {xy E Arm | xEA™ y E€ A™, Jr E€ Re : count(r,x) = o}. 
Recall that R, = {r € R : |r| > en}. Thus prob(Q) = prob(N, is not e-net) which is 


exactly what we want to bound. 
The second auxiliary event looks somewhat weird at first: 


Ji= {xy c A?™ E EA™, y E€ A™, Jr € R¿:count(r,x) = 0 and count(r, y) > >} f 
This event satisfies J C Q and contains pairs of sequences x and y with somewhat 
contradicting properties for some r. While the first part x fails to contain any element 


from r, the second part y has many elements from r. 


Claim 1. prob(J) < prob(Q) < 2prob(J). 


The first inequality is a consequence of J C Q. To prove the second inequality, we 


show that ” 

prob(J) _ probJNQ) _ srob(J 10) > L. 

prob(Q)  prob(Q) 2 
So suppose that xy € Q, with “witness” r, meaning that r € R. and count(r,x) = 0. 
We show that xy € J with probability at least 1/2, for every fixed such x and y chosen 
randomly from A™. This entails the claim. 

The random variable count(r, y) is a sum of m independent Bernoulli experiments 
with success probability p := |r|/n > e (thus expectation p and variance p(1 — p)). 
Using linearity of expectation and variance (the latter requires independence of the ex- 
periments), we get 


E(count(r,y)) = pm, 
Var(count(r,y)) = p(1—p)m. 


Now we use Chebyshew's inequality?] to bound the probability of the “bad” event that 
count(r,y) < E. We have 


prob (count(r, y) < m 


prob (count(r, y) < >) 


IN 


IN 


prob (Icount(r, y) — E(count(r, y))| > 5) 


Il 


ap a y) ) 


—p1? zZ 
2 4(1—p)? _ 4(1 Pet 
p(1—pjm pm pm em 2 


prob (Icount(r,y — E(count(r, y))| > 


prob(|X — E(X)| > kVar(X)) < EVH 
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. Hence 


since m > = 


N| = 


prob (count(r, y) > =) > 


and the claim is proved. 

Now the weird event J reveals its significance: we can nicely bound its probability. 
The idea is this: We enumerate A as A = {aj, d2,..., adn) and let the type of z € At be 
the n-sequence 


type(z) = (count(a,,z), count(az, z), ...,count(an, z)). 


For example, the type of z = (1,2, 4,2,2,4) w.r.t. A = {1, 2,3, 4} is type(z) = (1,3,0,2). 

Now fix an arbitrary type t. We will bound the conditional probability prob(xy € J | 
xy has type T), and the value that we get is independent of t. It follows that the same 
value also bounds prob(J). 


Claim 2. prob(xy € J | xy has type t) < 0s(2m)2 "Y", 


To analyze the probability in question, we need to sample z = xy uniformly at random 
from all sequences of type t. This can be done as follows: take an arbitrary sequence z’ 
of type t, and then apply a random permutation 7 € S2,, to obtain z = Tt(z'), meaning 
that 

Zi = Zap) for all i. 


Why does this work? First of all, 7 preserves the type, and it is easy to see that all 
sequences z of type T can be obtained in this way. Now we simply count the number of 
permutations that map z’ to a fixed z and see that this number only depends on T, so 
it is the same for all z. Indeed, for every element a; € A, there are ti! many ways of 
mapping the a;'s in z’ to the a;'s in z. The number of permutations that map z’ to z is 


therefore given by 
n 
I] Til. 
i=1 


By these considerations, 
prob(xy € J | xy has type 1) = prob(rr(z') € J). 


To estimate this, we let S be the set of distinct elements in z’ (this is also a function 
of the type T). Since (X, R) has VC-dimension 5, we know from Lemma that 
[Ri | < Os(2m), so at most that many different subsets T of S can be obtained by 
intersections with ranges r € R, and in particular with ranges r € R¿. 

Now we look at some fixed such T, consider a permutation 7 and write 7(z’) = xy. 
We call T a witness for n if no element of x is in T, but at least em/2 elements of y are 
in T. According to this definition, 


mz’')€J <= has some witness T C S. 
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By the union bound, 


prob(7(z’) € J) = prob(AT: T is a witness for 71) < a prob(T is a witness for 71). 
T 


Suppose that z’ contains l > em/2 occurences of elements of T (for smaller £, T 
cannot be a witness). In order for m(z’) = xy to be in J, no element of T can appear in 
the first half x, but all occurrences of elements from T must fall into the second half y. 
Therefore, the probability of T being a witness for a random permutation is 


(7)  m(m-1)--:(m—1+1) a 
ES memo) amotar] ©? <2 2, 


since among all the ‘ee equally likely ways in which 7 distributes the £ occurences in 


z, exactly the (7) equally likely ways of putting them into y are good!*] Summing up 
over all at most ®5;(2m) sets T, we get 


em 


prob(xy € J | xy has type tT) < Os(2m)2 2, 


for all 7. 
The proof is finished by combining the two claims: 


em 
2 


prob(N, is not e-net) = prob(Q) < 2prob(J) < 205(2m)27 


Questions 


62. What ts a range space? Give a definition and a few examples. 


63. What ts an epsilon net? Provide a formal definition, and explain in words what 
it means. 


64. What is the VC dimension of a range space Give a definition, and compute the 
VC dimension of a few example range spaces. 


65. Which range spaces always have small epsilon nets (and how small), and 
which ones don’t? Explain Theorem [13.2] 


66. How can you compute small epsilon nets? Explain the general idea, and give 
the analysis behind the weaker bound of Theorem 


67. How can you compute smaller epsilon nets? Sketch the main steps of the proof 
of the stronger bound of Theorem 


13This argument can be made more formal by explicitly counting the permutations that map (1,2,...,£) 
to the set {1,2,...,m}. We leave this to the reader. 
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